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BIOLOGICAL CHIPS The methods of the invention can be used to detect varia- 

tions between a target and reference sequence, including 
CROSS-REFERENCE TO RELATED single or multiple base subsdtutions, and deletions and 

APPLICATION . . .. 5 insertions of bases, as well as detecting the presence, 

Tu'^' Ji- *• c I- a- \r '>,rt;, location, and sequence of other more consplcx variations 

■ ■ f of W Of application Scr. No. 08A43,312. between a target and reference" sequence in i nucleic acid. • 
filed Oct 26, 1993, now abandoned,^ The present invention provides arrays of oligonucleotide 

f ^''i nnl"-^' plication Sen No. 082.937, filed 25 p^bes immobilized on a solid support. Tlie arrays arc 
Jun. 1993, now abandoned, mcorporated herein by refer- preferably synthesized directly on the support using 

. . VLSIPSyw technology, but other synthesis methods and 

Research leading to the invention was funded in part by immobilization of pre -synthesized oligonucleotide probes 
NIH grant No. 1R01HG008 13-01 and DOE grant No. can be used to make the oligonucleotide probe arrays, called 
DE-FG03-92-ER81275, and the government may have ccr- "DNA chips", of the invention. In general, these arrays 
tain rights to the invention. comprise a set of oligonucleotide probes such that, for each 

" base in a specific reference sequence, the set includes a 

BACKGROUND OF THE INVENTION p robe (called the" wild-type" or" WT* probe) that is exactly 

T . complementary to a section of the reference sequence 

1. t-ield of the Invention including the base of interest and four additional probes 
The present invendon provides arrays of oligonucleotide (called "substitution probes"), which are identical to the WT 

probes immobilized in microfabricated patterns on silica 20 probe except that the base of interest has been replaced by 
chips for analyzing molecular interactions of biological oneof a predetermined set (typically 4) of nucleotides. In the 
interest. The invention therefore relates to diverse fields prefcrredembodiment, oneof the four substitution probes is 
impacted by the nature of molecular interaction, including identical to the wild type probe; the other three arc comple- 
chemistry, biology, medicine, and medical diagnostics. mentary to targeU that have a single-base substitution at this 

2. Description of Related Art 25 position. ' . . 

-I .'J i_ . 1 . , , In another aspect, the mvention relates to the arrancemeat 

OLgonuc leoude probes have long been used to detect j^^ividual probes in the array. In one embodiment, the 

complementary nucleic aad sequences m a nucleic acid of p^bes are arranged on the chip so that probes for a given 

interest (the target nucleic acid). In some assay formats, position in the sequence are adjacent, and probes for adja- 

the oligonucleotide probe is tethered, i.e., by covalent cent positions in the reference sequence are also adjacent to 

attachment, to a solid support, and arrays of oligonucleotide one another on the chip. One method arranges the probes for 

probes imrnobilized on solid supports have been used to a single base in a short - column (alternately .row) and ; 

detect specific nucleic acid sequences in a target nucleic ' arranges the columns in the order of the base position to 

acid. Sec, e.g., PCT patent publication Nos. WO 89/10977 form horizontal (altcraatcly vertical) stripes. The wild-type 

and 89/11548. Others have proposed the use of large num- - and each of the substitution probes have specified positions 

bers of oligonucleotide probes to provide the complete within the column so that all the probes corresponding to an 

nucleic acid sequence of a target nucleic but failed to A substitution, for example, are in a single row. The stripes 

provide an cnabhng method for using arrays of immobilized separated on the chip by a blank row or column, 

probes for this purpose. See U.S. Pat. Nos. 5,202,231 and Th® Dl^A chips of the invention can be made in a wide 

5,002,867 and PCT patent publication No WO 93A7126 number of variations. For some applications, leaving out the 

The development of VLSIPS™ technology has provided" y^d-^P^ ^o^, leaving out um^^ 

methods for making very large arrays of obgonucleotide "it^^ilf S f P^^^^^' varying the length 

probes in very small arra^i. Sel U.S. Pat. No. 5:143.854 and ^^^^ " Tf"" 

PCT patent publication Nos. WO 90/15070 and 92/10092, 7 f'^T % ° '"""^'^ """^"l^"? 

each of which is incorporated hereb by reference. U.S "^^7' "^"f the mutaUon posiUon, using mulaple 

patent application Sen No. 082,937. filed 7un. 25, 1993 ^^^^ ".'^t,' l^ ^^^l'^^^^ P^^^ 

5^/.„i,-r\»-n,»j- r_ _ f c I- 1 .-J arrays, placing blank "sUects" (no probe) between rows, 

Sfh, n ^, ^ , °^ oygon«deot.de ^^j^.^*^ j^^^j^^j and using control probes may 

probes that can be used to provide the complete sequence of be aDorooriate *^ ' & r j 

a Urgel nucleic acid and to delect the presence of a nucleic r i -j txkta i.- r ^ 

acid containing a specific nucleotide ^quence. so - "^^Pj^''^^'^''^^^'?^^^^^^^ • 

\M* CL • . J • •? , , ^" mg mutations associated with cystic fibrosis, mcludmg 

Micrpfabncatcd arra^ o^^^^^ numbers of oligbnucle- ^^^^^^^^ ^ 4 7 ^ 20, and 21 of the CFTR^' 

otideprob«,cdled ;.DNAcbps-^ ' gene. TCe invention also provides DNA chips for iletecti^g 
wide variety of applications. New methods and reagents are ^^,^aons in the p53 gene, a gene in which muUtions are 
required to realize this promise, and the present mvention known to be associated with a wide variety of cancers. Other 

neips meet that need. 55 j^j^^ ^^^^^ invention provide probe arrays for dctecl- 

SUMMARY OF THE INVENTION specific sequences of mitochondrial DNA, useful for 

identification and forensic purposes. The invention also 
The present mvention provides methods for making high- provides DNA chips for detecting specific sequences of 
density arrays of oligonucleotide probes on silica chips and nucleotides or mutations associated with the acquisition of a 
for usmg those probe arrays to detect specific nucleic acid 60 jmg resistant phcnotype in an infectious organism, such as 
sequences contained in a target nucleic acid in a sample. The rffampicin or other drug resistant TB strains and HIV, in 
invention also provides arrays of oligonucleotide probes on vvhich muUtions in an RNA polymerase gene arc known to 
DNA chips, m which the probes have specific sequences and gjyc rise to drug resistance, 
locations in the array to facih'tate identification of a specific 

Urgct nucleic acid. In another aspect, the invention provides 6S ^^^^^ DESCRIFHON OF THE DRAWINGS 
methods for detecting whether one or more specific FIG. 1 shows how the tiling method of the invention 
sequences of a taigct nucleic acid in a sample varies from a defines a set of DNA probes relative to a target nucleic acid. 
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In the figure, the target is a DNA molecule, the probes are 
single-stranded nucleic acids 16 nucleotides in length, and 
only a portion of the probes defined by the method is shown. 

FIG. 2 shows an illustrative tiled array of the invention 
^ with probes for the detection of point mutations. The base at 
the position of substitution in each of the wild-type probes 
is shown in the wild-type lane, and the shading shows the 
location of the substitution probe having the wild-type 
sequence. The SEQ ID. NOS. corresponding to the two 
peptide sequences shown in the top portion of FIG. 2 are 311 
and 312, respectively. The SEQ ID. NOS. corresponding to 
the five peptide sequences listed at the bottoni of FIG. 2 are 
313, 314, 315, 313, and 316, respectively. 

FIG. 3, in panels A, B, and C, shows an image made from 
the region of a DNA chip containing CFTR cxon 10 probes; 
in panel A, the chip was hybridized to a wild-type target; in 
panel C, the chip was hybridized to a mutant AF508 target; 
and in panel B, the chip was hybridized to a mixture of the 
wild-type and mutant targets. The SEQ ID. NOS. corre- 
sponding to the four peptide sequences shown in FIG. 3 are 
317-320, respectively. 

FIG. 4, in sheets 1-3, corresponding to panels A, B, and 
C of FIG. 3, shows graphs of fluorescence intensity versus 
tiling position. The labels on the horizontal axis show the 
bases in the wild-type sequence corresponding to the posi- 
tion of substitution in the respective probes. Plotted are the 
intensities observed fi-om the features (or synthesis sites) 
contaim'ng wild-t>'pe probes, the features containing the 
substitution probes that bound the most target ("called"), and 
the feature containing the substitution probes that bound the 
• target with the second highest intensity of, all the substitution 
probes ("2nd Highest"). The SEQ ID. NOS. corresponding 
to the two peptide sequences shown in sheet 1 of FIG. 4 are 

321 and 318, respectively; the SEQ ID. NOS. corresponding 
to the two peptide sequences shown in sheet 2 of FIG. 4 are 

322 and 318, respectively; and the SEQ ID. NOS. corre- 
sponding to the two peptide sequences shown in sheet 3 of 
FIG. 4 are 323 and 318, respectively. 

FIG. 5, in panels A, B, and C, shows an image made firom 
a region of a DNA chip containing CFTR cxon 10 probes; 
in panel A, the chip was hybridized to the wt480 target; in 
panel C, the chip was hybridized to the mu480 target; and in 
panel B, the chip was hybridized to a mixture of the 
wild-type and mutant targets. The SEQ ID. NOS. corre- 
sponding to the peptide sequences shown in FIG. 5 are 
324-327, respectively. 

FIG, 6, in sheets 1-3, corresponding to panels A, B, and 
C of FIG. 5, shows graphs of fluorescence intensity versus 
' tiling position. The labeb on the horizontal axis show the 
bases in the wild-type sequence corresponding to the posU 
tioo of substitution in the respective probes: Plotted are the 
intensities observed from the features (or synthesis sites) 
containing wild-type probes, the features containing the 
substitution probes that bound the most target ("called"), and 
the feamre containing the substimtion probes that bound the 
target with the second highest intensity of all the substitution 
probes ("2nd Highest"). The SEQ ID. NOS. corresponding 
to the two peptide sequences shown in sheet 1 of FIG. 6 are 
328 and 329, respectively; the SEQ ID. NOS. corresponding 
to the two peptide sequences shown in sheet 2 of FIG. 6 are 
330 and 329, respectively; and the SEQ ID. NOS. corre- 
sponding to the two peptide sequences shown in sheet 3 of 
FIG. 6 are 331 and 329, respectively. 

FIG. 7, in panels A and B, shows an image made from a 
region of a DNA chip contaim'ng CFTR cxon 10 probes; in 
panel A, the chip was hybridized to nucleic acid derived 
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from the genomic DNA of an individual with wild-type 
AF508 sequences; in panel B, the target nucleic acid origi- 
nated from a heterozygous (with respect to the AF508 
mutation) individual. 

FIG. 8, in sheets 1 and 2, corresponding to panels A and 
B of FIG. 7, shows graphs of fluorescence intensity versus 
tiling position. The labels on the' horizontal axis show the 
bases in the wild-type sequence corresponding to the posi- 
tion of substitution in the respective probes. Plotted are the 
intensities observed firom the features (or synthesis sites) 
containing wild-type probes, the features containing the 
substitution probes that bound the most target ("called"), and 
the feature containing the substitution probes that bound the 
target with the second highest intensity of all the substitution 
probes ("2nd Highest"). The SEQ ID NOS. corresponding to 
the two peptide sequences shown in sheet 2 of FIG. 8 are 332 
and 318, respectively. 

FIG. 9 shows the human mitochondrial genome; "O^** is 
the H strand origin of replication, and arrows indicate the 
cloned unshaded sequence. 

FIG. 10 shows the image observed from application of a 
sample of mitochondrial DNA derived nucleic acid (from 
the mt4 sample) on a DNA chip. 

FIG. 11 is similar to FIG. 10 but shows the image 
observed from the mtS sample. 

FIG. 12 shows the predicted difference image between the 
mt4 and mt5 samples on the DNA chip based on mismatches 
between the two samples and the reference sequence. 

FIG. 13 shows the actual difference image observed for 
the mt4 and mt5 samples. • ' . - ; 

FIG. 14, in sheets 1 and 2, shows a plot of normalized 
intensities across rows 10 and 11 of the array and a tabula- 
tion of the mutations detected. 

FIG. 15 shows the discrimination between wild- type and 
mutant hybrids obtained with the chip. A median of the six 
normalized hybridization scores for each probe was taken; 
the graph plots the ratio of the median score to the normal- 
ized hybridization score versus mean counts. A ratio of 1.6 
and mean counts above 50 yield no false positives. 

FIG. 16 illustrates how the identity of the base mismatch 
may influence the ability to discriminate mutant and wild- 
type sequences more than the position of the mismatch 
within an oligonucleotide probe. The mismatch position is 
expressed as % of probe length from the 3 -end. The base 
change is indicated on the graph. 

FIG. 17 provides a 5' to 3' sequence listing of one target 
coaesponding to the probes on the chip. X is a control probe. 
Positions that differ in the target (i.e., are mismatched with 
the probe at the designated site) are in bold. The'SE,Q ID. 
NO. corresponding to the peptide, sequence shown in FIG. 
17 is 333. 

FIG. 18 shows the fluorescence image produced by scan- 
ning the chip described in FIG. 17 when hybridized to a 
sample. 

FIG. 19 illustrates the detection of 4 transitions in the 
target sequence relative to* the wild-type probes on the chip 
in RG. 18. 

FIG. 20 shows the alignment of some of the probes on a 
p^^ DNA chip with a 12-mer model target nucleic acid. The 
SEQ ID. NOS. corresponding to the fourteen peptide 
sequences shown in FIG. 20 are 334-347, respectively. 

FIG. 21 shows a set of 10-mer probes for a p53 cxon 6 
DNA chip. The SEQ ID. NOS. corresponding to the thirteen 
peptide sequences shown in FIG. 21 are 334 and 348-359, 
respectively. 
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FIG. 22 shows that very distinct patterns are observed in the nucleotide sequence of a target nucleic acid with 

after hybridization of p53 DNA chips with taigets having oligonucleotide probes of defined length. The length (L) of 

different 1 base substitutions. In the first image in FIG. 22, the probe is typically expressed as the number of nucleotides 

the 12-mer probes that form perfect matches with the or bases in a single-stranded nucleic acid probe. For pur- 
wild-type target are in the first row (top). The 12-mer probes 5 POses of the present invention, lengths rMgmg from 12 to 18 

with single base mismatches are located in the second, third. bases are prefcrrwl, although shorter and longer lengths can 

and fourth rows and have much lower signals. • also be employed. To employ the- Uling method, one syn- 

• no. 23, in graphs 2. 3. and 4. graphicaUy depicts the data 'h^izcs a set of probes defined by the parUcular nucleotide 

in no. 22 On each graph, the X orfinate is the position of ?«)"'°« °f j?'""' ^"''^'^ 
n: \~ ... ,° V- ._j •»,- V 10 in the target DNAsegmenl, one synthesizes a probe comple- 

the probe in >ls row on the chip, and the Y ordmate is the « K subsequence of thl target nucleic add b<igin- 

signal at that probe Site after h>br.djzation. J^^^^ j,^^ ^ 3..^.^^ J 

FIG. 24 shows the results of hybridizing mixed target 1) 
populations of WT and mutant p53 genes to the p53 DNA a preferred embodiment of the invenUon, the probes are 

^ ' ^ ^ A I, / T7ir> ii\ IS arranged (either by immobilization, typically by covalcnt 

FIG. 25, in graphs 1^ shows (see FIG, 23 as well) the ^^^^^^^^^^^ ^f a pre-synlhesized probe or by synthesis of the 

• hybridization cfficieDcy of a lO-mer probe array as com- ^^^^^ substrate) on the substrate or chips in lanes 

pared to a 12-mer probe array stretching across the chip and separated, and these lanes are 

HG. 26 shows an image of a p53 DNA chip hybridized to ^^^^ arranged in blocks of preferably 5 lanes, although 

a target DNA. blocks of other sizes will have useful appHcation, as will be 

FIG. 27 illustrates how the actual sequence was read from apparent from the following illustration. The first of these 

the chip shown in FIG. 26. Gaps in the sequence of letters j^ncs, called the "wild-type lane", contains probes 

in the WT rows correspond to control probes or sites. arranged in order of sequence, and all of the probes are 

Positions at which bases are miscalled are represented by complementary to a specified wild-type nucleic acid 

letters in italic type in cells corresponding to probes in which ^5 sequence. The other four lanes contain probe sets for dctcct- 

theWT bases have been substituted by other bases. The SEQ jjjg ^jj possible single-base mutations in the defined 

ID. NO. corresponding to the peptide sequence shown in sequence; in turn, these probe sets are defined by a position 

FIG. 27 is 360. =^ of potential non-complementarity in the probe relative to the 

FIG. 28 illustrates the VLSIPS™ technology as applied to laiget (i.e., a single base mismatch) and the identity of the 

lhelightdirectedsynthesisofotigonucleolides.Ught(hv) is nucleotide in the probe at that position (i.e., whether the 

shone through a mask (Mj) to activate functional groups nucleotide is an A, C, G, or T nucleotide). The position of 

( — OH) on a surface by removal of a protecting group ()Q. mismatch, also called the position of substitution, is prefer- 

Nucleosidc building blocks protected with photorcmovable ; ably selected to be near the center of the probes, i.e., position 

protecting groups (T-X, C-X) are coupled to the activated .7 of a probe of L-15, 

areas. By repealing the irradiation and coupling steps, very 35 p^^ gjch probe in the wild-type lane, one synthesizes four 

complex arrays of oHgonucleolides can be prepared. probes (one for each of the lanes other than the wild-type 

FIG. 29 illustrates how the VLSIPS^ process can be used |anc), Three of these four probes is identical to the corrc- 

to prepare "nucleoside combinatorials" or oligonucleotides sponding wild -type probe but for the base at the position of 

synthesized by coupling all four nucleosides to form dimers, substitution, and the remaining probe is identical to the 

trimcrs, etc. 40 wild-type probe. This set of four substitution probes is 

FIG. 30 shows the dcprotection, coupling, and oxidation preferably placed in a column directly below (or above) the 

steps of a solid phase DNA synthesis method. corresponding wild-type probe, thus creating an A-lane, a 

FIG. 31 shows an illustrative synthesis route for the C-lane, a G-Iane, and a T-laoc. FIG. 2 shows an illustrative 

nucleoside building blocks used in the VLSIPS^^ method. tiled array of the invention with probes for the detection of 

FIG. 32 shows a preferred photoremovable protecting 45 point mutations. The base at the posiU'on of substitution in 

group, MeNPOC, and how to prepare the group in active each of the wild-type probes is shown in the wild-type lane, 

fQfjQ and the shading shows the location of the substitution probe 

HG. 33 iUustrates an illustrative detection system for having the wild-type sequence. Below are the probes that 

scanning a DNA chip. ^o"^^ ^« P^^«^^ "'."^^^^ ^ 

• ■ . ■ 50 probe length were 15 and the position of substitution werc; 

DETAILED DESCRIPTION OF THE ' 7, 

INVENTION 3'.CCGAeTGCAGTCGTT (SEQ. ID. N0:1) . 
Using the VLSIPS™ method, one can synthesize arrays 3'-CCGACTACAGTCGTr (SEQ. ID. N0:2) 
of many thousands of oUgonucleolide probes on a substrate, 3'-CCGACTCCAGTCGTr (SEQ. ID. N0:3) 
such as a glass sUde or chip. The method can be used, for 55 3'-CCGACTGCAGTCGTr (SEQ. ID. N0:1) 
instance, to synthesize "combbatorial" arrays consisting of, 3'-CCGACTrCAGTCGTr (SEQ. ID. N0:4) 
for example, all possible oclanuclcotides. Such arrays can be Thus, the substitution lanes occupy four of the five lanes 
used for primary sequeocing-by-hybridization on genomic separating successive wild-type lanes on the chip; the blocks 
DNA fragments or other nucleic acids or to detect mutations of five lanes can be separated by a sixth lane for measure- 
in a target nucleic acid for which the normal or •Vild-type" 60 ment of background signals. 

nucleotide sequence is already known. Using the preferred The DNA chips of the invention have a wide variety of 

method of the invenU'on, one employs a strategy called applications. In one embodiment, the DNA chip is used to 

"tiling" to synthesize specific sets of probes or at spatially- select an optimal probe from an array of probes. In this 

defined locations on a substrate, creating the novel probe embodiment, an array of probes of variable length and 

arrays and "DNA chips" of the invention. 65 sequences is synthesized and then hybridized to a target 

To illustrate the tiling method of the invention, consider nucleic acid of known sequence. The pattern of hybridiza- 

Ihc problem of detecting muUtions at one or more position * tion reveals the optimal length and sequence composition of 
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probes to detect a particular mutatioo or other specific basesubstimtionandaoydeletion within the 192-baseexoD, 

sequence of Ducleotides. In some circumstances, i.e., target including the three-base deletion known as AF508. As 

nucleic acids with repeated sequences or with high G/C described in detail below, hybridizaU'on of sub-nanomolar 

content, very long probes may be required for optimal concentrations of wild-type and AF508 oligonucleotide tar- 

dctcction.. In one embodiment for detecting specific s get nucleic acids labeled wth fluorescein to these arrays 

• sequences in a target nucleic acid with a DNA chip, repeat . : P^dMces highly specific signals (detected with confocal 

sequences are detected as foUows. Tbe . chip comprises- scanning fluorescence microscopy) that permit d'scnm^a- 

probes of length sufficient to extend into the Icpc.l region H"" ^'^"'^ '^TX"!!^JyZ\"XZ'^Z^lZ^^ 
*^ . j.» c t_ j'TT- 1 • * homozyeous and heterozygous cases. The melnod and chips 
varying distances from each end The sample, paor to t^e invention can ali. be used to delect other known 
hybndizalion, is treated with a labeled oligonucleotide that 10 ^^^^^^^^ ^ ^FTR gene, as described in detail below. 
IS complementary to a repeat region but shorter than the fuU ^^^^ common cystic fibrosis mutation is known as 
length of the repeat. The target nucleic is labeled with a ^^^^^ htczvsc the mulaaon is a three-base deletion that 
second, distinct label. After hybridization, the chip is ^^^^i^ removal of amino acid from the CFTR 
scanned for probes that have bound both the labeled target protein. The present invention provides DNA chips for 
and the labeled oligonucleotide probe; the presence of such is delecting AF508, one such chip results from applying the 
bound probes shows that at least two repeal sequences are tiling method to cxon 10 of the CFTR gene, the cxon to 
present. which AF508 has been mapped. The tiling method involved 
A variety of methods can be used to enhance detection of the synthesis of a set of probes of a selected length in the 
labeled targets bound to a probe on the array. In one range of from 10 to 18 bases and comp lementary to subse- 
cmbodiment, the protein MutS (from £. colt) or equivalent 20 quenccs of the known wild-type CFTR sequence starting at 
proteins such as yeast MSHl, MSH2, and MSH3; mouse a position a few bases into the intron on the 5*-side of exon 
Rep-3, and Streptococcus Hex-A, is used in conjunction 10 and ending a few bases into the intron on the 3'-side. 
with target hybridization to delect probe-target complex that There was a probe for each possible subsequence of the 
contain mbmatched base pairs. The protein, labeled directly given segment of the gene, and the probes were organized 
or indirectly, can be added to the chip during or after 25 into a "lane" in such a way that traversing the lane from the 
hybridization of target nucleic acid, and dififcreniially binds upper left-hand comer of the chip to the lower rigjithand 
to homo- and heteroduplex nucleic acid. A wide variety of comer corresponded to traversing the gene segment base- 
dyes and other labels can be used for similar purposes. For by-base from the 5*-cnd. The lane containing that set of 
instance, the dye YOYO-1 is known to bind preferentially to probes is, as noted above, called the "wild-type lane." 
nucleic acids containing sequences comprising runs of 3 or 30 Relative to the wild -type lane, a "substitution" lane, called 
more G residues. the ^*A-lanc", was synthesized on the chip. The A-lane* 

TheDNAchipsproducedby the methods. of the mventioD . probes were identical in sequence to an adjacent 

can be used to study and detect mutations in exons of human (immediately below the conresponding) wild-type probe but* 

genes of clinical interest, including point mutations and contained,regardIessof the sequence of the wild-t3rpe probe, 

deletions. In the following sections, the method of the 35 a dA residue at position 7 (counting from the 3'-end). In 

invention is illustrated by the detection of mutations in a similar fashion, substimtion lanes with replacement bases 

variety of clinically and medically significant human nucleic dC, dG, and dT were placed onto the chip in a "C-lanc,** a 

acid sequences. Thus, the invention is illustrated first with "G-lane," and a "T-lane," respectively. A sixth lane on the 

respect to the preparation of DNA chips for the detection of chip consisted of probes identical to those in the wild-type 

mutations associated with cystic fibrosis, then with DNA 40 lane but for the deletion of the base in position 7 and 

chips for the detection of human mitochondrial DNA restoration of the original probe length by addition to the 

sequences, then with DNA chips for the detection of muta- 5'-cnd the base complementary to the gene at that position, 

tions in the human p53 gene associated with cancer, and The four substitution lanes enable one to deduce the 

finally with respect to the detection of mutations in the HIV sequence of a target cxon 10 nucleic acid from the relative 

RT gene associated with drug resistance. 45 intensities with which the target hybridizes to the probes in 

Detection of Cystic Fibrosis Mutations with DNA Chips the various lanes. The probe organization on the chip can be 

A number of years ago, cystic fibrosis, the most common conveniently columnar, and the set of probes consisting of a 

severe autosomal recessive disorder in humans, was shown wild-type probe and four corresponding substitution probes 

to be associated with mutations in a gene thereafter named is refcned to as a "column set." One and only one of the four 

the Cystic Fibrosis Transmembrane Conductance Regulator 50 substitution probes in a column set has exactly the same 

(CFTR) gene. The scqucnccs'of the exons and parts of the sequence as the wild-type probe in the set. Those of skill in 

introns in the gene' are known, as are the changes cone- the art will appreciate' that, in other embodiments of- the 

spending to several hundred known mutations. Several tests invention, one could delete one or more lanes or columns 

have been developed for detecting the most frequent of these and still benefit from the invention. Various versions of such 

mutations. The present invention provides CFTR gene oli- 55 exon 10 DNA chips were made as described above with 

gonucleolide arrays (DNA chips) that can be used to identify probes 15 bases long, as well as chips with probes 10, 14, 

mutations in the CFTR gene rapidly and cfficienUy. and 18 bases long. For the results described below, the 

The methods used to make the high-density DNA chips of probes were 15 bases long, and the position of subsUtution 

the invention allow probes for long stretches of DNA coding was 7 from the 3*-cnd. 

regions to be direcUy "written" onto the chips in the form of 60 To demonstrate the ability of the chip to distinguish the 

sets of overlapping oligonucleotides. These methods have AF508 mutation from the wild-type, two synthetic target 

been used to develop a number of useful CFTR gene chips, nucleic acids were made. The first, a 39-mcr complernentaiy 

one illustrative chip bears an array of 1296 probes covering to a subsequence of exon 10 of the CFTR gene having the 

the full length of cxon 10 of the CFTR gene arranged in a three bases involved in the AF508 mutation near its center, 

36x36 array of356Xm elements. The probes in the array can 65 is called the "wild-type" or wt508 target, corresponds to 

have any length, preferably in the range of from 10 to 18 positions 111-149 of the cxon, and has the sequence shown 

residues and can be used to detect and sequence any single- below: 
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S^CATTAAAGAAAAT/GrCATCTTTGGTGTTTCCW^ 

GATGA (SEQ. ID NO: 5). 
The second, a 36-mcr probe derived from the wild-type 
target by removing those same three bases, is called the 
"mutaot" target or mu50S target and has the sequence shown 5 
below, first with dashes to indicate the deleted bases, , and 
then without dashes but with one base underlined (to indi- 
cate the base detected by the T-Iane probe, as discussed 
below): 

5*-CATTAAAGAAAATATCAT lo 

TGGTGTITCCTATGArGA; (SEQ. ID N0:6) 
S'-CATTAAAGAAAATATCATTGGTGTITCCTATGAT^ 

(SEQ. ID N0:7) 
Both targets were labeled with fluorescein at the 5'-end. 

In three separate experiments, the wild-type target, the 15 
mutant target, and an equimolar mixture of both targets was 
exposed (0.1 nM wt508, 0.1 nM mu508, and 0.1 nM vrt508 ' 
plus 0.1 dM muSOS, respectively, in a solution compatible 
with nucleic acid hybridization) to a CF chip. The hybrid- 
ization mixture was incubated overnight at room 20 
temperature, and then the chip was scanned on a reader (a 
confocal fluorescence microscope in photon-counting mode; 
images of the chip were constructed from the photon counts) 
at several successively higher temperatures while still in 
contact with the target solution. After each temperature 25 
change, the chip was allowed to equilibrate for approxi- 
mately one-half hour before being scanned. After each set of 
scans, the chip was exposed to denaturing solvent and 
conditions to wash, i.e., remove target that had bound, the 
chip so that the next experiment could be done with a clean 30 
chip. . * . ■ 

The results of the experiments are shown in FIGS. 3, 4, 5, . 
and 6; FIG. 3, in panels A, B, and C, shows an image made 
from the region of a DNA chip containing CFER exon 10 
probes; in panel A, the chip was hybridized to a wild-type 35 
target; in panel C, the chip was hybridized to a mutant delta 
508 target; and in panel B, the chip was hybridized to a 
mixture of the wild -type and mutant targets. FIG. 4, in sheets 
1-3, corresponding to panels A, B, and C of FIG. 3, shows 
graphs of fluorescence intensity versus tiling position. The 40 
labels on the horizontal axis show the bases in the wild-type 
sequence corresponding to the position of substitution in the 
respective probes. Plotted are the intensities observed from 
the features (or synthesis sites) containing wild-type probes, 
the features containing the substitution probes that bound the 45 
most target (''called"), and the feature containing the sub- 
stitution probes that bound the target with the second highest 
intensity of all the substitution probes ("2nd Highcsl**). 

These figures show that, for the wild-type target and the 
equimolar mixture of targets, the substitution probe with a 50 - 
nucleotide sequence identical to the corresponding wild- , 
type probe bound the most target, -allowing for ah imam- 
biguous assignment of target sequence as shown by letters 
near the points on the curve. The target wt508 thus hybrid- 
ized to the probes in the wild-type lane of the chip, although 55 
* the strength of the hybridization varied from probe-to-probe, 
probably due to differences in melting temperature. The 
sequence of most of the target can thus be read directly from 
the chip, by inference from the pattern of hybridization in 
the lanes of substitution probes (if the target hybridizes most 60 
intensely to the probe in the A-Iane, then one infers that the 
target has a T in the position of substitution, and so on). 

For the mutant target, the sequence could similarly be 
called on the 3'-side of the deletion. However, the intensity 
of binding declined precipitously as the point of substitution 65 
approached the site of the deletion from the 3*-cnd of the 
target^ so that the binding intensity on the wild-type probe 



whose point of substitution corresponds to the Tat the 3'-cnd 
of the deletion was very close to background. Following that 
pattern, the wild-type probe whose point of substitution 
corresponds to the naiddle base (also a 1) of the deletion 
bound still less target. However, the .probe in the T-lane of 
that column set bound the target very well. 

Examination of the sequences of the two targets reveals 
that the deletion places an A at that position when the 
sequences are aligned at their 3'-ends and that the T-lane 
probe is complementary to the mutant target with but two 
mismatches near an end (shown below in lower-case letters, 
with the position of substitution underHned): 

Target: 5*-CATTAAAGAAAATATCATTGGTGT- 
TFCCr/aGAJGA 

Probe: 3'-TagTAGTAACCACAA (SEQ. ID N0:8) 
Thus the T-lane probe in that column set calls the correct 
base from the mutant sequence. Note that, in the graph for 
the equimolar mixture of the two targets, that T-lane probe 
binds aknost as much target as does the A-lane probe in the 
same column set, whereas in the other column sets, the 
probes that do not have wild-type sequence do not bind 
target at all as well. Thus, that one column set, and in 
particular the T-lane probe within that set, detects the AF508 
mutation under conditions that simulate the homozygous 
case and also conditions that simulate the heterozygous case. 

The present invention thus provides individual probes, 
sets of probes, and arrays of probe sets on chips, in specific 
patterns, as the probes provide important benefits for detect- 
ing, the presence of specific exon 10 sequences. The 
sequences of several important probes of the invention are 
shown below. In each case, the letter "X" stands. for the point 
of substitution in a given colunin set, so each of the 
sequences actually represents four probes, with A, C, G, and 
T, respectively, taking the place of the "X." Sets of shorter 
probes derived from the sets shown below by removing up 
to five bases from the 5'-end of each probe and sets of longer 
probes made from this set by adding up to three bases from 
the exon 10 sequence to the 5'-cnd of each probe, are also 
useful and provided by the invention. 
3'-TrrATAXTAGAAACC (SEQ. ID N0:9) 
3*-TTArAGXAGAAACCA (SEQ. ID N0:10) 
3*-TArAGTXGAAACCAC (SEQ. ID N0:11) 
3'-ATAGTAXAAACCACA (SEQ. ID N0:12) 
3'-TAGTAGXAACCACAA(SEQ. ID N0:13) 
3'-AGTAGAXACCACAAA (SEQ. ID N0:14) 
3'-GTAGAAXCCACAAAG (SEQ. ID N0:15) 
3'-TAGAAAXCACAAAGG (SEQ. ID N0:16) 
3'.AGAAACXACAAAGGA (SEQ. ID NO: 1*0 

Although in this example the sequence could not be 
■reliably deduced near the ends of the target, where there is 
not enough overlap between target and probe. to allow 
effective hybridization, and around the center of the target, ' 
where hybridization was weak for some other reason, per- 
haps high AT-content, the results show the method and the 
probes of the invention can be used to detect the mutation of 
interest. The mutant target gave a pattern of hybridization 
that was very similar to that of the wtSOS target at the ends, 
where the two share a common sequence, and very different 
in the middle, where the deletion is located. As one scans the 
image from right lo left, the intensity of hybridization of the 
target to the probes in the wild-type lane drops off much 
more rapidly near the center of the image for mu508 than for 
wt508; in addition, there is one probe in the T-lane that 
hybridizes intensely with mu508 and hardly at all with 
wt508. The results from the equimolar mixture of the two 
targets, which represents the case one would encounter in 
testing a heterozygous individual for the mutation, are a 
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blend of the results for the separate targets, showing the 
power of the invention to distinguish a wild-type target 
sequence &om one containing the AF50S mutation and to 
detect a mixture of the two sequences. 
. The results above clearly demonstrate how the DNA chips 5 
of the invention can be used to detect a deletion mutation, 
AF508; another model system was lised to shovv that the 
chips can also be used to detect a point mutation as well. One 
of the more frequent mutations in the CFTR gene is G480C, 
which involves the replacement of the G in position 46 of lO 
exon 10 by a T, resulting in the substitution of a cysteine for 
the glycine normally in position #480 of the CFTR protein. 
The model target sequences included the 21-mer probe 



12 



lems. The wild-type sequence could easily be read from the 
chip, but the probe that bound the mu480 target so well when 
only the mu480 target was present also bound it well when 
both the mutant and wild-type targets were present in a 
mixture, making the hybridization pattern easily distinguish- 
iable from that of the wild-type target alone. These results 
again show the power of the DNA chips of the invention to 
detect point mutations in both homo- and heterozygous 
individuals. 

To demonstrate clinical application of the DNA chips of 
the invention, the chips were used to study and detect 
mutations in nucleic acids from genomic samples. Genomic 
samples from a individual carrying only the wild-type gene 



W1480 to represent the wild-type sequeiace at positions 
37-55 of exon 10: S'-CCrrCAGAGGGTAAAATTAAG 15 andanindividualheterozygousforAF503 were amplified by 
(SEQ. ID N0:18) and the 21-mer probe mu480 to represent PCR using exon 10 primers containing the promoter for T7 
the mutant sequence: 5'-CCTrCAGAGTGTAAAAITAAG RNA polymerase. Illustrative primers of the invention arc 
(SEQ. ID N0;19). shown below. 



Exoa Name Sequence 



10 Cri9-T7 TAATACGACrCACWTAGGGAGatgacctaataalgatgggnt 

10 Cril0c-T7 TAATACGACrCACrATAGGGAGtflgtgtgaagggttcatotgc 

10 CTilOc-'n CTXXKjAATIAAarcrCACrAAAGGtaglgtgaagggttcatatg 
10, n CTilO-T? TAATACGACrCACTArAGGGAGagcataclaaaagtgactclc 

11 CFillc-T7 TAATACGACrCACIXrAGGGAGacatgaalgacatttacagcaa 
11 Crillc-T3 CGOAyOTAACrcrCACTAAAGGacatgaatgacalttacagcaa 



(SEQ. ID. NO20) 
(SEQ. ID. NO:21) 
(SEQ. ID. NO:22) 
(SEQ. ID. NO:23) 
(SEQ. ID. N0124) 
(SEQ. ID. N025) 



In separate experiments, a DNA chip was hybridized to 30 These primers can be used to amplify exon 10 or exon 11 
each of the targets wt4S0 and mu480, respectively, and then . . sequences; in another embodiment, multiplex .PCR is 
scanned with a confpcal microscope. FIG. 5, in panels A, B, ^ . , employed, using two or more pairs of primers to amplify, 
and C, shows an image made from the region of a DNA chip more than one exon at a time. • 
containing CFTR exon 10 probes; in panel A, the chip was The product of amplification was then used as a template 
hybridized to the wt480 target; in panel C, the chip was 35 for the RNA polymerase, with fluoresceinated UTP present 
hybridized to the mu480 target; and in panel B, the chip was to label the RNA product. After sufficient RNA was made, 
hybridized to a mbcture of the wild-type and mutant targets. it was fragmented and applied to an exon 10 DNA chip for 
FIG. 6, in sheets 1-3, corresponding to panels A, B, and C 15 minutes, after which the chip was washed with hybrid- 
of FIG. 5, shows graphs of fluorescence intensity versus ization buficr and scanned with the fluorescence micro- 
tiling position. The labels on the horizontal axis show the 40 scope, A useful positive control included on many CP exon 
bases in the wild-type sequence corresponding to the posi- 10 chips is the 8-mer 3'-CGCCGCCG-5'. FIG. 7, in panels 
tion of substitution in the respective probes. Plotted arc the A and B, shows an image made from a region of a DNA chip 
intensities observed from the features (or synthesis sites) containing CFTR exon 10 probes; in panel A, the chip was 
containing wild-type probes, the features containing the hybridized to nucleic add derived from the genomic DNA of 
substitution probes that bound the most target ("called"), and 45 an individual with wild-type AF508 sequences; in panel B, 
the feature containing the substimtion probes that bound the the target nucleic acid originated from a heterozygous (with 
target with the second highest intensity of all the substitution respect to the AF508 mutation) individual. FI<3. 8, in sheets 
probes ("2nd Highcsr). 1 and 2, corresponding to panels A and B of FIG. 7, shows 

These figures show that the chip could be used to graphs of fluorescence intensity versus tiling position, 
sequence a 16-basc stretch from the center of the target 50 -These figures show that the sequence of the wild-type 
wt480 and that discrimination against mismatches is quite RNA can be called for most of the bases near the mutation. 
• good throughout the sequenced region. When the DNA chip • In the case of the AF508 heterozygous carrier; one particular 
was exposed to the target mu480, only one probe in the' probe, the same one that distinguished so cleariy between 
portion of the chip shown bound the target well: the probe the wild-type and mutant oligonucleotide targets in the 
in the set of probes devoted to identifying the base at 55 model system described above, in the T-lane binds a large 
position 46 in exon 10 and that has an A in the position of amount of RNA, while the same probe binds little RNAfrom 
substitution and so is fully complementary to the central the wild-type individual. These results show that the DNA 
portion of the mutant target All other probes in that region chips of the invention arc capable of detecting the AF508 
of the chip have at least one mismatch with the mutant target mutation in a heterozygous carrier, 
and therefore bind much less of it. In spite of that fact, the 60 Thus, the present invention provides methods for synthc- 
scqucnce of mu480 for several positions to both sides of the sizing large numbers of oligonucleotide probes on a glass 
mutation can be read from the chip, albeit with much- substrate and unique probe sets in a defined array in which 
reduced intensities from those observed with the wild-type the probes are arranged in the array by the "tiling" method 
target. of the invention. The DNA chips produced by the method 

The results also show that, when the two targets were 65 can be used to detect mutations in particular sequences of a 
mixed together and exposed to the chip, the hybridization target nucleic acid, such as genomic DNA or RNA produced 
pattern observed was a combination of the other two pal- from transcription of an amplified genomic DNA. These 
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chips can be used to detect both point znutatioDS and small 
deletions. Moreover, the pattern of hybridization to the chip 
allows inferences to be drawn about the sequences of the 
mutant DNAs. 

For example, in the model system involving the cystic 
.fibrosis point mutation G480C, the A-lane probe ■ whose 
position of substitution corresponds, to the position of the 
mutation does not bind much wild-type target, because in the 
wild-type sequence, a G occupies that position. However, it 



some applications to using a minimal set of oligonucleotides 
specific to the sequence of interest, rather than a set of all 
possible N-mers. Some of these advantages include: (i) each 
position in the array is highly informative, whether or not 
hybridization occurs; (ii) nonspecific hybridization is mini- 
mized; (iii) it is straightforward to correlate hybridization 
differences with sequence differences, particularly with ref- 
erence to the hybridization pattern of a known standard; and 
(iv) the ability to address each probe independently during 



binds mutant target very well, alloxving one to infer correctly 10 gy^thesis, using high resolution photolithography, allows the 



that the mutation involves a change of that G to a T. 
Similarly, in the case of the three-base deletion in cystic 
fibrosis known as AF508, the T-lane probe that binds mutant 
target so intensely is responding to the fact that the deletion 
has brought a CAT sequence into the position occupied by 15 
a err sequence in the wild-type target. The DNA chips of 
the invention can be used to detect and sequence not only 
known mutations in an organism's genome but also new 
mutations not previously characterized. The DNA chips and 



array to be designed and optimized for any sequence. For 
example the length of any probe can be varied independently 
of the others. 

The present invention illustrates these advantages by 
providing DNA chips and analytical methods for detecting 
specific sequences of human mitochondrial DNA. In one 
preferred embodiment, the invention provides a DNA chip 
for analyzing sequences contained in a 13 kb firagmcnt of 



methods of the invention can also be used to detect specific 20 human mitochondrial DNA from the "D-loop" region, the 



sequences in .other CETR cxons as well as other human 
genes for' purposes of research and clinical genetic analysis, 
as demonstrated below. 

Detection of Specific Human Mitochondrial DNA 
Sequences with DNA Chips 

As noted above, the present invention provides DNA 
chips on which a known DNA sequence is represented as an 
array of overlapping oligonucleotides on a solid support. 
This set of oligonucleotides is used to probe a target nucleic 



most polymorphic region of human mitochondrial DNA. 
One such chip comprises a set of 269 overlapping oligo- 
nucleotide probes of varying length in the range of 9->14 
nucleotides with varying overlaps arranged in ^00x600 
25 micron features or synthesis sites in an array 1 cmxl cm in 
size. The probes on the chip are shown in columnar form 
below. An illustrative mitochondrial DNA chip of the inven- 
tion comprises the following probes (X, Y coordinates are 



shown, followed by the sequence; "DL3" represents the 
acid comprising the known sequence, allowing mutations to 30 3 -end of the probe, which is covalently attached to the chip 
be detected. As also noted above, there are advantages in surface.) * 



0 0 DLSAGTCGGGTATTT 

1 0 DUGGGIAnTAGTr 

2 0 DLnTAGTITArcCAA 

3 0 DUATCCAAACCAGG 

4 0 DUAOCAGGATOGGA 

5 0 DL30CjIXiTCm>TGTGG 

6 0 DL30GTGTGTGTGTGGC 

7 . 0 DL3TCGTGTQTGTGTGG 

8 0 DUGTAGGATGGGTC 

9 0 DLSAGGATGGGTCXjT 

10 0 DUGArGGGTCGTGT 

11 0 DLJTCGCGAOGAITG 

12 0 DL3G0GACX>ArTGQO 

13 0 DL3TCKKjGGGGA 

14 0 DL3GAGGGGGCXJ 

15 0 DL3GGAGGGGGCXtA 

16 0 DL3GAGGGGGOGA 

0 1 DLSGGcrrcorrGG 

1 1 DUGGTrGGTITGGG 

2 1 DUTCGGCnTCTAG 

3 a DLSGTrrCtAGTGGO 

4 1 DUAGTCKKKKKjTGT. 

5 1 DUGGGGTOTCAAAT 

6 1 DLBGICAAATACArCG 

7 1 DUACArCGAATGGAG 

8 1 DUOGAATGGAGGAG 

9 1 DLSGAGGACmTCGT 

10 1 DUTiTOGrrrArGTCA 

11 1 DL3AnjTGACrmAC 

12 1 DUGACmTACAAAr 

13 1 DUAAAfCTCCOCGA 

14 1 DUAArcroOXGAG 

15 1 DL3CCCGAGTXnACT 

16 1 DUAGTGTAGTGGGG 

0 2 DUGGGAGGGTCAG 

1 2 DUGGTCAGGGTAIG 

2 2 DUGGTATGArGATTAG 

3 2 DL3GArrAGAQTAAGT 

4 2 DUTTAGAGTAAGTrA 



(5EQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 



NO:26) 

tiO'2S) 
HOJS) 
NOJO) 
NOJl) 
NO:32) 
NO:33) 
NO-.34) 

NO:36) 
NO:37) 
KO:33) 



(SEQ ID. NO J9) 
(SEQIO.KO:40) 
(SEQ ID.NO:41) 
(SEQ1D.N0:42) 
(SEQID.KO:43) 
(SEQID.NO:44) 
(SEQID.N0;4^ 
(SEQID.NO:46) 
(SEQID.Na47) 
(SEQ ID. N0:4S) 
(SEQ ID. NO:49) 
(SEQU).Na.50) 
(SEQnXKOJl) 
(SEQiD.NO-.52) 
(SEQ ID. Na53) 
(SEQ ID. NO'J4) 
(SEQID.NO:55) 
(SEQ ID. NO*5d) 
(SEQ ID. NO-J7) 
(SEQ1D.N058) 
(SEQID.NO-,59) 
(SEQID.NO:eO) 
(SEQID.NO:61) 
(SEQID.NO:62) 



9 2 DL3GGTAGGAXGGGT 

10 2 DUGGATGGGTCX3TG 

11 2 DL3GGT0GTGTGTGT 

12 2 DL3GTGTGTGTGGCG 

13 2 DL3TGTGGCOACGAr 

14 2 DLSGACGATTOGGGT 

15 2 DLSATTGGGGTArGG 

16 2 DUGTArGGGGCnXj 

0 3 DLSGGAITGTGGTCG 

1 3 DL3TGGTa3GAITGG 

2 3 DLSGGATTGCnnAAA 

3 3 DL3TCrAAAGTnAAA 

4 3 DLSGTrTAAAATAGAA 

5 3 DL3ATAGAAAAACCG 

6 3 DUAGAAAAACCGC 

7 3 DL3AACOGCCATAC 

8 3 DL3CCATACGTGAAAA 

9 3 DLSACGTGAAAATTGT 

10 3 DL3AAarrGTCAGTGGG 

11 3 DL3TGTX:AGT0GGGG 

12 3 • DUTGGGGTTGA - 

13 3 DUGGGTrGATrGTGT 

14 3 DLHTGTGTAAIAAAA 

15 3 DL3AArAAAAGGG0A 

16 3 DL3TAAAAGGOGAGG 

0 4 DL3Gill'lllAAAGG 

1 4 DUmTAAAGGTGG 

2 4 DUAGOTOGTITGO 

3 4 DL3TrGGOGGGGAG 

4 4 DL3GGAGGGGGCG 

5 4 DL3GGGGCGAAGAC 

6 4 DL3GAAGACOGGATG 

7 4 DUCCGGATCTCOTG 

8 4 DUGTOCTGAAmGT 

9 4 DL3CGTOAAnTGTQT 

10 4 DUTTGTGTAGAGACG 

11 4. DUTAGAGAOGGnr 

12 4 DUACGGmTGGGG 

13 4 DUTGGGGTmTGT 

14 4 DUGGO'iiiilUlll 



(SEQ ID. NO:67) 
(SEQ ID. NO:68) 
(SEQ ID. NO:69) 
(SEQ ID. NO:70) 
(SEQ ID. N0:71) 
(SEQ ID. NO:72) 
(SEQ ID. NO:73) 
(SEQ ID. NO:74) 
(SEQ ID. N0:7^ 
(SEQ ID. NO:76) 
(SEQ ID. N0:7^ 
(SEQ ID. NO:78) 
(SEQ ID. NO:79) 
(SEQ ID. NO:80) 
(SEQ ID, N0:81) 
(SEQ ID. NO:82) 
(SEQ ID. NO:83) 
(SEQ ID, NO:84) 
(SEQ ID. NO:85) 
(SEQ ID. NO:86) . 
(SEQ ID. NO:87) 
(SEQ ID. NO:88) . 
(SEQ ID. N0:'S9) 
(SEQ ID. n6:90) 
(SEQ ID. N0:91) 
(SEQ ID. NO:92) 
(SEQ ID. NO:93) 
(SEQ ID. NO:94) 
(SEQ ID. N05i) 
(SEQ ID. N0:9d) 
(SEQ ID. N057) 
(SEQ ID. m^S) 
(SEQ ID. N059) 
(SEQID.NO:100) 
(SEQ ID. NO:101) 
(SEQ ID. NO:102) 
(SEQ ID. NO:103) 
(SEQ ID. NO:104) 
(SEQ ID. N0:1Q5) 
(SEQ ID. NO:106) 
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DUAAGTTATGTrGOG 

DUGTTGGGGGCG 

DL3GGGGGGGGTA 

DUGCGGGTAGGAX 

DL3ACACAAITAAITAA 

DUAATTAAITACGAA 

DLSTACGAACATtXTTG 

DUACGAACATCCrGT 

DUTCXTGTArTAITA 

DUGTArTArTArrGTT 

DUATTGTIAAACrrA 

DL3AAACITACAGACG 

DLSACAGACGTGTCG 

DUGTGTCGGTGAAA 

DUGTGAAA GCnXjT GT 

DUGGTGTGTCTCTAG 

DUTGTGTCnj TAGTA 

DUGTAGTATrGTrrr 

DUAGTArrGTrmr 

DLSCCrCGTGGGATA 

DUTGGGATACAGCG 

DUGATACAGCXjTCAT 

DLSGCGTCATAGACAG 

DL3AGACAGAAACTAA 

DL3CAGAAACIAAGGA 

DL3TAAGGACGGAGT 

DL3GA0GGAGTAGGA 

DLSGTAGGATAATAAA 

DLSTAATAAATAGCG 

DL3ArAGCGTAGGAr 

DL3TAGCGTAGGArG 

DUAGGATCCAAGIT 

DLSATGCAAGTTATAA 

DLSGTTATAATGTCCG 

DLSATGTCOGCITGT 

DUTCCXjCTTOTATG 

DL3GTGAGTCCCCrc 

DLSTGCOCTCGAGAG • , 

DUCCICGAGAGGTA 

DL3AGAGGTACGTAA 

DLSACGTAAACCATA 

DUACCATAAAAGCAG 

DL3AAAGCAGACCC 

DLSAGACCCCCCAT 

DUCCCOCATACGT 

DLSCATACXjTGCGCr 

DUGTGOGCTAICAG 

DLSGCGCTATCAGTA 

DL3TCAGTAA0GCrC 

DUGTAACXiCrCTGC 

DUAGTCTATCXXCA 

DUATCCCCAGGGA 

DL3CAGGGAACTGGT 

DUACTGGTGGTAGG 

DL3CrGGTGOTAGGA 

DL3GTAGGAGGCACA 

DL3GGCACArrTAGT 

DL3nTAGTrAIAGGG 

DUAGGTITACGGTXj 

DL3TACGGTGGGGA 

DUGTGGGGAGTGG 

DL3GGGAGTGGGTGA * 

DUGGGTOATCXTArc 

DLSCCTArGGTIGrTrT 

DUGGTronrGGAIO 

DLSGnTGGAIXKiGT 

DLSATGGGTGGGAAr ' 

DLSGGGAAITGTCArG 

DUGarCAXGTArCATGT 

DL3TCArCTXnTCGG 

DUTAmCGGTAAA 

DUTTCGGTAAATGO 

DLSGTAAATGGCATGT 

DLSGCArCTAATCGTG 

DUGTAATCGTGTAAr 

DUGGGAGGGGTAC 

DUGGGTACGAATGT 

DUACXjAATOTTCGIT 

DLSTGITCGTICArCT 

DLSCGTTCArGTCGTr 



(SEO ID. NO:63) 
(SEQ ID. NO:64) 
(SEQ ID. KO:65) 
(SEQ ID. NO:66) 
(SEQ ID. KO:lll) 
(SEQ ID. Nb:112) 
(SEQ ID. NO:113) 
(SEQ ID. NO:114) 
(SEQ ID. NO:115) 
(SEQID.KO:llQ 
(SEQ ID. KO:ll^ 
(SEQ ID. NO:118J 
^EQ ID. NO:119) 
(SEQ ID. NO:120) 
(SEQ ID. KO:121) 
(SEQ ID. NO;122) 
(SEQ ID. NO:123) 
G5EQ ID. NO:124) 
(SEQ ID. NO:l25) 
C5EQ ID. NO:l26) 
(SEQ ID. NO:l27) 
(SEQ ID. NO:128) 
(SEQ ID. NO:129) 
^EQ ID. NO:130) 
(SEQ ID. NO:131) 
(SEQ ID. NO:132) 
(SEQ ID. KO:133) 
(SEQ ID. NO:134) 
(SEQ ID. NO:135) 
(SEQID.NO:136) 
(SEQ ID. NO:137) 
(SEQ ID. NO:138) 
(SEQ ID. NO:139) 
(SEQ ID. KO:140) 
(SEQ ID. NO:141) 
(SEQ ID. NO:142) 
(SEQ ID. NO:143) 
(SEQ ID. NO:144) 
(SEQID. NO:145) 
(SEQ ID. NO:146) 
(SEQ ID. NO:147) 
(SEQ ID. NO:143) 
(SEQID.NO:149) 
(SEQ ID. NO:150) 
(SEQID.NO:151) 
(SEQ ID. NO:152) 
(SEQID.NO:153) 
(SEQ ID. N0:154) 
pEQ ID. NO:355) 
(SEQ ID. NO:l56) 
(SEQ ID. NO:203) 
(SEQ ID. NO:204) 
(SEQiaNO:205) 
(SEQID.NO-.206) 
C5EQ ID. KO:207) 
(SEQ ID. NO;208) 
(SEQ ID. NO:209) 
(SEQ ID. NOllO) 
(SEQ ID. NO-.2n) 
(SEQID. NOil2) 
CSEQID.N0.213) 
(SEQ ID. NO:214) 
(SEQID.NO:215) 
(SEQ ID. NO1210 
(SEQ ID. K0:21^ 
^EQID.NO:218) 
(SEQ ID. NO:219) 
(SEQ ID. NO-.220) 
(SEQ ID. N0a21) 
(SEQ ID. NO:222) 
(SEQ ID. NO-.223) 
(SEQ ID. NO:224) 
(SEQ ID. NO-.225) 
(SEQ ID. NO-.226) 
CSEQ ID. NO-.227) 
(SEQ ID. NO:228) 
C5EQ ID. KO:229) 
CSH3 ID. NO-.230) 
(SEQ ID. NCh231) 
(SEQ ID. NO:232) 



DurrGTrrcrroGG 

DUrcTTGGGArrGTQ 
DLyrGTATGAATGAnr 
DUTGAmCACACAA 
DUCrCTGCGACCrC 
DL3GACCrCXK3CCr 
bUrCGGCCrCGTG 
DUGATGAAGTCCCAG 
DUAGTCCCAGTATIT 

DUGTArrrcGGArrr 

DL3rcGGAnTArCG 
DUGATITArOGGGT 
DUATCXjGGTGTGCA 
. DUIXiTCCAAGGGGA 
DUCAAGGGGAAnr 
DUGAAnTAITCTGTA 
DLJlC fG T A GTGCTAC 
DUGTAGTGCTACCr 
DL3GCIACCTAGTAG 
DLSCTAGTAGTOCAGA 
DL3TCCAGArA9TGGG 
DUAGArAGTOGGATA 
DUGGOATAArrGGT 
DUTAAITGGTGAGTG 
DUTATAGGGCGTGT 
DUGGGCGTGTTCICA 
DLSGIGTrCTCACGAr 
DLJrCACGArGAGAGG 
DUATGAGAGGAGCG 
DUAGGAGCGAGGC 
DL3CGAGG0CaK} 
DL3GCCCXKKjMT 
DL3CGGGTAITGTGA 
DL3GTGAACO0CCAT 

DUCcccArcGArrr 
DUArcGAmcAcrr 
DunrcAcrrGACAr 

DLyrrGACATAGAGCr 
bUTAGAGCrGTAGAC 
DUGTAGACCAAGGA 
DUACCAAGGATGAAG 
DLSCGTGIAArGrcAO 
DUTGTCAOrrrAGGG 
DUTCAGTITAGGGA 
DUTAGGGAAGAGCA 
DUAAGAGCAGGGGT 
DUCAGGGGTACCTA 
DLSGGTACCTACTCG 
DUTACTGGGGGGA 
DLSGGGGGAGTCTAT 

DUCATCTArrnrGG 

DUmTGGGTrAGG 
DLSGGGTTAG GArGT 
DUGGATCTAOTnTG 
DUTCTAGTnTGGG 
DLJITTGOGGGAGG 
DUGGGTrCATAACrG 
DUATAACTGAGTGGO 
DUAACTGAGTCGGT 
DUGTGGGTAGTTGT 
DUGTACilIGTTGGC 
DL3<nTGG0GArACA 
DL3CGArACATAXAAO 
DUTAAAAGCATGTAA 
DUGCATCTAArGACG 
DUATGACGGTCGGT 
DUGTCGGIGGTACr 
DUGGTACrrATAACA 
DUrrCGAmCTAAGAr 
DUTAAGAITAAAnT 
DUAAArnOAATAAG 
DUAATAAGAGACAAG 
DUAAGAGACAAGAAA 
DL3AAGAAAGTACCC 
DLJAAAGIAOCCCrr 
DUCCCCliCOTCTA 
DUCIiC G TCT A AAC 
DUClAAACCCArGG 
DUAACCCATGGTGG 
DUTGGTGGGITCAr 



(SEQ ID. NO:107) 
(SEQ E). NO:103) 
(SEQ ID. NO:109) 
(SEQ ID. NO:110) 
(SEQ ID. N0:15?^ 
(SEQ ED. N6:158) 
' (SEQ ID. N0:159) 
(SEQ ID. NO:160) 
(SEQ ID. N0:16l) 
(SEQID.NO:162) 
(SEQID.NO:163) 
(SEQ ID. NO:164) 
(SEQ ED. N0:165) 
(SEQ ID. N0:166) 
(SEQ ID. NO:167) 
(SEQ ID. N0:16S) 
(SEQ ID. NO:169) 
(SEQ ED. NO:170) 
(SEQ ID. N0:17l) 
(SEQ ID. NO:172) 
(SEQ ED. NO:173) 
(SEQ ID. NO:174) 
(SEQ ID. N0:17^ 
(SEQ ID. N0:17Q 
(SEQ ED. NO:177) 
(SEQ ID. NO:178) 
(SEQ ID. KO;179) 
(SEQ ID. NO:1SO) 
(SEQ ID. N0:181) 
(SEQ ID. KO:182) 
(SEQ ID. NO:183) 
(SEQ ID. NO:184) 
(SEQ ID. NO:185) 
(SEQ ID. N0:186) 
(SEQ ID. NOilS'O 
(SEQ ID. NO:188) 
(SEQ ID. NO:189) 
(SEQ ID. NO:190) • 
(SEQ ID. N0:191) ^ 
(SEQ ID. NO:192) 
(SEQ ID. NO:193) 
(SEQ ID. NO:194) 
(SEQ ID. NO:195) 
(SEQ E). N0:19^ 
(SEQ ID. N0:19^ 
(SEQ lb. NO:198) 
(SEQ ID. NO:199) 
(SEQ ID. NO:200) 
(SEQ ID. NO:201) 
(SEQ ED. KOa02) 
(SEQ ED. NO:246) 
(SEQ ID. NO:247) 
(SEQ ED. NO:248) 
(SEQ D. NO-.249) 
(SEQ ID. NO:250) 
(SEQ ID. N0i51) 
(SEQ ID. NO:252) 
(SEQ ID. NOi53) 
(SEQID,NO-254) 
(SEQ ID. N0i5^ 
(SEQ ID. NO:25a) 
(SEQ'ID.N0-.25^ 
(SEQ ID. NO:25a) 
(SEQID.NO:259) 
(SEQID.NOa60) 
(SEQ ID. N0261) 
(SEQ ID. NO-.262) 
(SEQ ID. NO:263) 
(SEQ ID. NO:264) 
(SEQ ID. NO-.265) 
(SEQ ID. NO:26«) 
(SEQ ID, NO:267) 
(SEQ ID. NO:268) 
(SEQ ID. NO:269) 
(SEQED.NOa70) 
(SEQ D. N0i7l) 
(SEQ ID. N0572) 
(SEQ ©. NOt273) 
(SEQID.NO:274) 
(SEQID.NO-.27^ 
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-coDtinued 



10 12 DUGTCGTIAGTrGG 

11 12 DUTAGTTGGGAGTT 

12 12 DUGGAGmGATAGTG 

13 12 DUATAGTGTGTAGTT 

14 -12 bLSGTFiAGtTGACGt 

15 . 12 DL3TGACXjTTGAGGT 

16 12 DL3bGTn>AGGTnA 

5 13 DUTATAACATGCCAT 

6 13 DL3AACATGCCArGGT 

7 13 DLSCCArGGTATTAr 

8 13 DUAnTATGAACTCG 

9 13 DL3AACTGGTGOACAr 

10 13 DUTGGACATCArGTA 



(SEQ ID. NOa33) 5 16 

(SEQ ID. NO:234) 6 16 

CSEO ID. NOi35) 7 16 

(SEQ ID. N0a3^ 8 16 

(SEQ ID; KO:237) 9 16 
(SEQ ID. NOi238) • 10 16 

(SEQ ID. NO:239) . 11 16 

(SEQ ID. NO:240) 12 16 

(SEQ ID. KO:241) 13 . 16 

(SEQ ID, NO:242} 14 16 

(SEQ ID. N0243) 15 16 

(SEQ ID. NOa44) 16 16 
(SEQ ID. NO:245) 



DUTTGGAAAAAGGT (SEQ ID. NO:276) 
DL3/^AAAGGT^CC^G (SEQ ID. NO:277) 
DLSGGTTCCTGTTTA (SEQ ID. NO:278) 
DL3CCIlrt U AGTCrC (SEQ ID. KO:279) 
DLTTTAGTCrCrmr . (SEQ ID. N6:280) 
DLSCrmrCAGAAAr (SEQ id. N0:281) 
DUAGAAAITGAGGTG (SEQ ID. NOa82) 
DUAAAITGAGGTGGT (SEQ ID. NO:283) 
DUGGTGGTAATCGT (SEQ ID. NO:284) 
DL3rAAnX>TGGGTr . (SEQ ID. NO:285) 
DLSGTGGGTrrCGAT ; (SEQ ID. NO:286) 
DUGGTTrCGArrcr . (SEQ id. N0:287) 



No probes were present in positions X, Y-0, 12 to X, Y-4, 
12; X, Y«0. 13 to X, Y-4, 13; X, Y-0. 14 to X. Y-4. 14; X. 
Y-0, 15 to X, Y-.4, 15; X, Y-0, 16 to X, Y-4, 16; The length 
of each of the probes on the chip was variable to minimize 
differences in melting temperature and potential for cross- 
hybridization. Each position in the sequence is represented 
by at least one probe and most positions are represented by 
2 or more probes. As noted above, the amount of overlap 
between the oligonucleotides varies from probe to probe. 
FIG, 9 shows the human mitochondrial genome; "O^" is the 



IS 



20 



and in several cases, the differences were within noise levels. 
Improvements can be realized by increasing the amount of 
0%'erlap between probes and hence overall probe density 
and, for duplex DNA targets, using a second set of probes, 
either on the same or a separate chip, corresponding to the 
second strand of the target. FIG. 14, in sheets 1 and 2, shows 
a plot of nomaalized intensities across rows 10 and 11 of the 
array and a tabulation of the mutations detected. 

FIG. 15 shows the discrimination between wild-type and 
mutant hybrids obtained with this chip. The median of the 



H strand origin of replication, and arrows indicate the cloned 25 six normaUzed hybridization scores for each probe was 
unshaded sequence. taken. The graph plots the ratio of the median score to the 

DNA was prepared from hair roots of six human donors normalized hybridization score versus mean counts. On this 
(natl to mt6) and then amplified by PGR and cloned into graph, a ratio of 1.6 and mean counts above 50 yield no false, 
M13; the resulting clones were sequenced using chain positives, and whDe it is clear that detection of some mutants 
terminators to verify that the desired specific sequences were 30 can be improved, excellent discrimination is achieved, con- 



present. DNA from the sequenced M13 clones was amplified 
by PGR, transcribed in vitro, and labeled with fluorescein - 
UTP using T3 RNApolymerase. The 1.3 kb RNA transcripts 
were fragmented and hybridized to the chip. The results 



sidering the small size of the array. FIG. 16 illustrates how 
the identity of the base mismatch may influence the ability 
to discriminate mutant and wild-type sequences more than 
the position of the mismatch within an oligonucleotide 



showed that each different individual had DNA that pro- 35 f'^^l °i^^?^^^P??^^°° ^ expressed^ as % of probe 



duced a unique hybridization fingerprint on the chip and that 
the differences in the observed patterns could be correlated 
with differences in the cloned genomic DNA sequence. The 
results also demonstrated that very long sequences of a 
target nucleic acid can be represented comprehensively as a 40 
specific set of overlapping oUgonuclcotides and that arrays 
of such probe sets can be usefully applied to genetic analy- 
sis. 



length from the 3'-end. The base change is indicated on the 
graph. These results show that the DNA chip increases the 
capacity of the standard reverse dot blot format by orders of 
magnitude, extending the power of that approach many fold 
and that the methods of the invention are more efficient and 
easier to automate than gel-based methods of nucleic acid 
sequence and mutation analysis. 

These advantages become more apparent as chips with 
more and more probes are employed. To illustrate, the 
present invention provides a DNA chip for analyzing human 



The sample nucleic acid was hybridized to the chip in a 
solution composed of 6xSSPE, 0,1% Triton-X 100 for 60 45 mitochondrial DNA (mtDNA) that* "tiles" fiirough 648 
minutes at 15** C. The chip was then scanned by confocal nucleotides of human H strand mtDNA from positions 
scanning fluorescence microscopy. The individual features 16280 to 356. The probes in the array arc 15 nucleotides in 
on the chip were 588x588 microns, but the lower left 5x5 length, and each position in the target sequence is reprc- 
square features in the array did not contain probes. To scntcdby asctof4probcs(A,C, G,TsubstituU'ons),which 
quantitatc the data, pixel counts were" measured within each 50 differed from one another at position 7 from the 3'-cnd. The, 



synthesis site. Pixels represent 50x50 microns. The fluores 
cence intensity for each feature 'was scaled to a mean 
determined from 27 bright features. After scanning, the chip 
was stripped and rehybridizcd; all six samples were hybrid- 
ized to the same chip. FIG. 10 shows the iriiage observed 55 ^ I'^' 
from the mt4 sample on the DNA chip. FIG. 11 shows the ^ 
image observed from the rat5 sample on the DNA chip. FIG. 
12 shows the predicted difference image between the mt4 

and mt5 samples on the DNA chip based on mismatches 

between the two samples and the reference sequence (see 60 sequences using prim'crs lagged 'with 13* and T7 RNA 
Anderson ct al., 1981, Nature 290: 457-465, incorporated polymerase promoter sequences and in vitro transcription to 
herein by reference). FIG. 13 shows the actual difference produce fluorcscein-UTP labeled RNA. The RNA was frag- 
image observed. mcntcd and hybridized to the oligonucleotide array in a 

The results show that, in almost all cases, mismatched solution composed of 6xSSPE, 0.1% Triton X-100 for 60 
probe/taiget hybrids resulted in lower fluorescence intensity 65 minutes at 18° C. Uohybridized material was washed away 
than perfectly matched hybrids. Nonetheless, some probes with buffer, and the chip was scanned at 25 micron pbccl 
detected mutations (or spedfic sequences) better than others, resolution. 



array consists of 13 blocks of 4x50 probes: each block scans 
through 50 nucleotides of contiguous mtDNAsequencQ.Tbe 
blocl^ are separated by blank rows. The 4 comer columns 
contain control probes; there are a total of 2600 probes in a 
1.28 cmxl.28 cm square area (feature), and each area is 
microns. 

Labeled RNA target DNA was prepared by PGR ampli- 
fication of a 1.3 kb region of human mtDNA spanning 
positions 15935 to 667, cloning into M13 (sequence verifl- 
cation was performed), and reampliflcation of the cloned 
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FIG. 17 provides a 5' to 3* sequence listing of one target 
corresponding to the probes on the chip. X is a control probe. 
Positions that differ in the target (i.e., arc mismatched with 
the probe at the designated site) are in bold. FIG. 18 shows 
the fluorescence image produced by scanning the chip when 5 
hybridized to this sample. About 95% of the sequence could , 
be read correctly from only one strand of the original duplex 
target nucleic acid. Although some probes did not provide 
excellent discrimination and some probes did not appear to 
hybridize to the target cffidently, excellent results were lO 
achieved. The target sequence differed firom the probe set at 
six positions: 4 transitions and 2 insertions. All 4 transitions 
were detected, and specific probes could readily be incor- 
porated into the anay to detect insertions or deletions. FIG. 
19 illustrates the detection of 4 transitions in the target 15 
sequence relative to the wild-type probes on the chip. 

These results illustrate that longer sequences can be read 
using the DNA chips and methods of the invention, as 
compared to conventional sequencing methods, where read- 
ing length is limited by the resolution of gel electrophoresis. 20 
Similar results were observed when genomic DNA samples 
were prepared from human hair roots. Hybridization and 
signal detection reqm're less than an hour and can be readily 
shortened by appropriate choice of buffers, temperatures, 
probes, and reagents. In principle, longer sequence reads can 25 
be obtained than by conventional sequencing, where reading 
length is limited by the resolution of gel electrophoresis. 
P53 Sequencing and Diagnostic DNA Chips 
P53 is a tumor suppressor gene that has been found to be 



between the particular mutation in p53 and the functioning 
of the resulting protein. Furthermore, there are projects 
looking at the gcrmline inheritance of p53 mutations and the 
development of cancer. The present invention provides 
useful DNA chips and methods for such studies. 

In. addition, the present invention also provides a diag- 
nostic test kit and method and p53 probes immobilized on a 
DNA chip in an organized array. Currently available diag- 
nostic tests for cancer typically have a sensitivity of about 
50%. The present invention provides significant advantages 
over such tests, and in one embodiment provides a method 
for detecting cancer-causing mutations in p53 that involves 
the steps of (1) obtaining a biopsy, which is optionally 
fractionated by cryostat sectioning to enrich tumor cells to 
about 80% of the total cell population. The DNA or RNA is 
then extracted, amplified, and analyzed with a DNA chip for 
the presence of p53 mutations correlated with malignancy. 

To illustrate the value of the DNA chips of the present 
invention in such a method, a DNA chip was synthesized by 
the VLSIPS™ method to provide an array of overlapping 
probes which represent or tile across a 60 base region of 
cxon 6 of the p53 gene. To demonstrate the ability to detect 
substitution mutations in the target, twelve different single 
substitution mutations (wild type and three different substi- 
tutions at each of three positions) were represented on the 
chip along with the wild type. Each of these mutations was 
represented by a scries of twelve 12-mer oligonucleotide 
probes, which were complementary to the wild type target 
except at the one substituted base. Each of the twelve probes 



mutated in most forms of cancer (see Levine ct al, 1991,. 30 was complementary to a different region of the target and 



Nature 351: 453-456, and Hollstein et al, 1991, 5aV/ice 
253: 49-53/ each . of which is incorporated herein by. 
reference), In addition, there is a hereditary syndrome, 
Li-Fraumeni, in which individuals inherit mutant alleles of 
p53 and tend to have cancer at relatively young ages 35 
(Frcbourg et al., 1992, PMAS 89; 6413-6417, incorporated 
herein by reference). During the development of a cancer, 
p53 Is inactivated. The course of p53 inactivation generally 
involves a mutation in one copy of p53 and is often followed 
by deletion of the other copy. After p53 is inactivated, 40 
chromosomal abnormalities begin to appear in tumors. In 
the best understood form of cancer, colorectal cancer, well 
over 50%, perhaps 80%, of all patients with tumors have p53 
mutations. In addition, p53 mutations have been found in a 
high proportion of lung, breast, and other tumors (Rodrigues 45 
ct al, 1990, PNAS 87: 7555-7559, incorporated herein by 
reference). According to data presented by David Sidransky 
(1992 San Diego Conference), over 400 mutations in p53 are 
known. 

The p53 gene spans.20 kbp in humans and has 11 cxons, 50 
10 of which are protein coding (sec Tominaga ct al., 1992, ' . 
Critical Reviews in Oncogenesis 3: 257-282, incorporated * • 
herein by reference). The gene produces a 53 kilodalton 
phosphoproteb that regulates DNA replication. The protein 
acts to halt replication at the Gl/S boundary in the cell cycle 55 
and is believed to act as a "molecular policeman," shutting 
down replication when the DNA is damaged or blocking the 
reproduction of DNA viruses (see Lane, 1992, Nature 358: 
15-16, incorporated herein by reference). There is substan- 
tial interest in the cancer research community in analyzing 60 
p53 mutations. The NCI is currently funding contracts to 
characterize the p53 mutation spectra caused by various 
carcinogens. In addition, there arc research projects which 



contained the mutated base at a different position, e.g., if the 
substitution was at base 32, the set of probes would be 
conipleraentary-with the exception of base 32 — to regions 
of the target 21-32, 22-33, and 32-43). This enabled inves- 
tigation of the effect of the substitution position within the 
probe. The alignment of some of the probes with a 12-mer 
model target nucleic acid is shown in FIG. 20. 

To demonstrate the effect of probe length, an additional 
scries of ten 10-mer probes was included for each mutation 
(see FIG. 21). In the vicinity of the substituted positions, the 
wild-type sequence was represented by every possible over- 
lapping 12-mer and 10-mcr probe. To simplify comparisons, 
the probes corresponding to each varied position were 
arranged on the chip in the rectangular regions with the 
following structure: each row of cells represents one 
substitution, with the top row representing the wild type. 
Each column contains probes complementary to the same 
region of the target, with probes complementary to the 
3'-end of the target on the left and probes complementary to 
the 5'-end of the target on the right The difference between 
two adjacent columns is a single base shiftin the positioning 
of the probes. 'Whenever possible, the series of 10-mcr 
probes were placed in four rows immediately underneath 
and aligned with the 4 rows of 12-mcr probes for the same 
mutation. 

To provide model targets, 5* fiuoresceinated 12-meri5 
containing all possible substitutions in the first position of 
codon 192 were synthesized (sec the starred position in the 
target in FIG. 20). Solutions containing 10 nM target DNA 
in 6xSSPE, 0.25% Triton X-100 were hybridized to the chip 
at room temperature for several hours. While target nucleic 
was hybridized to the chip, the fluorophorcs on the chip were 
excited by light from an argon laser, and the chip was 
scanned with an autofocusing confocal microscope. The 



involve sequencing p53 from spontaneously arising tumors, 

A major resource in these studies is the huge supply of 65 emitted signals were processed by a PC to produce an image 

biopsy material stored in parafiSn blocks. Also, there are using image analysis software. By 1 to 3 hours, the signal 

projects which are aimed at analyzing the relationship had reached a plateau; to remove the hybridized target and 
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allow hybridization to another target, the chip was stripped For sequencing, the p53 DNA can be cloned from the 
with 60% formamide, 2xSSPE at IT* C for 5 minutes. The sample or directly amplified from genomic DNAby PCR. If 
washing buffer and temperature can vary, but the buffer genomic PCR is used, then the DNA can be diluted prior to 
typically contains 2-to-3xSSPE, 10-10-60% formamide (one - amplification so that a single copy of the gene is amplified. 
. can use inultiple washes, increasing the formamide conccn- 5 For diagnostic piirposcs, the genomic DNA can be isolated 
tration by 10% each wash, and scanning between washes to • from a tumor biopsy in which the tumor cells inay be the 
' determine when the wash is complete), and optionally a majority population. As noted, aix)vc, . the proportion of. 
small percentage of Triton X-100, and the temperature is' tumor cells in a sample can be enriched by cryostat section- 
typically in the range of 15** to 18** C. ing. DNA can also be isolated and amplified from tumor 

Very distinct patterns were observed after hybridization ao samples stored in parafSn blocks, 
with targets with 1 base substitutions and visualization with The p53 DNA in the sample can be amplified by PCR 
a confocal microscope and software analysis, as shown in (although other amplification methods can be used) using 
FIG. 22. In general, the probes which form perfect matches 3-4 primer pairs generating amplicons of <3 kbp each, 
with the taiget retain the highest signal. For example, in the Illustrative primers of the invention for amplifying exon 5 of 
first image in Figure PC, the 12-mer probes that form perfect 15 the p53 gene are shown below (B is biotin; F is fluorescein), 
matches with the wild-type (WT) target are in the first row 5'-B-CACTTGTGCCCTGACTTTCAAC-3*(SEQ. ID 
(lop). The 12-mer probes with single base mismatches arc NO:288) " 

located in the second, third, and fourth rows and have much 5'-F-CACTrGTGCCCTGACmCAAC-3* 
lower signals. The data is also depicted graphically in FIG. 5 -ArGCAATTAACCCTCACTAAAGGGAGACACTTG- 
23. On each graph, the Xordinatc is thcposition of the probe 20 TGCCCTGACnTCAAC-3'(SEQ. ID NO:289) (has T3 
in its row on the chip, and the Y ordinate is the signal at that promoter) 

probe site after hybridization. 5'-B-GACCCTGGGCAACCAGCCCTGTCGT-3'(SEQ. ID 

When a target with a different one base substitution is NO;290) . ' 
hybridized the complementary set of probes has the highest 5'-F-G ACCCTGGGCAACCAGCCCrGTCGT-3' 
signal (see pictures 2, 3, and 4 in FIG. 22 and graphs 2, 3, 25 5'-TAATACGACTCACTArAGGGAGGACCCTGGGCA- 
and 4 in FIG. 23). In each case, the probe set with no ACCAGCCCTGTCGT-3'(SEQ. ID NO:291) (has T3 
mismatches with the target has the highest signals. "Within a promoter) 

12-mer probe set, the signal was highest at position 6 or 7. After PCR amplification of the target (the amplified target is 
The graphs show that the signal difference between 12-mer called the "amplicon") one strand of the amplicon can then 
probes at the same X ordinate tended to be greatest at 30 be isolated, i.e., using a biotinylated primer that allows 
positions 5 and 8 when the target and the complementary capture of the undesired strand on streptavidih beads, 
probes formed 10 base pairs and 11 base pairs, respectively. . Alternatively, asynametric PCR can be uscjd to generate a 
Because tumors often have bdlh \VT and mutant p53 genes, single-stranded target. Another approach involves the gen- 
mixed target populations were also hybridized to the chip, as " eration of single stranded RNA form the PCR product by 
shown in FIG. 24. When the hybridization solution consisted 35 incorporating a T7 or other RNA polymerase promoter in 
of a 1:1 mixture of WT 12-mer and a 12-mer with a oneof the primers. The single-stranded material can option- 
substitution in position 7 of the target, the sets of probes that ally be fragmented to generate smaller nucleic acids with 
were perfectly matched to both targets showed higher sig- less significant secondary structure than longer nucleic 
nals than the other probe sets. acids. 

The hybridization efficiency of a 10-mer probe array as 40 In one such method, fragmentation is combined with 
compared to a 12-mer probe array was also compared. The labeling. To illustrate, degenerate 8-mers or other degenerate 
10-mer and 12-mer probe arrays gave comparable signals short oligonucleotides are hybridized to the single-stranded 
(sec graphs 1^ in FIG. 23 and graphs 1-4 in tiG. 25). target material. In the next step, a DNA polymerase is added 
However, the 10-mer probe sets, which are in rows 5-S (see with the four different dideoxynucleolides, each labeled with 
images in FIG. 22), seemed to be better in this model system 45 a different fluorophore. Fluorophore-labcled dideoxynuclc- 
than the 12-mer probe sets at resolving one target firom otide are available from a variety of commercial suppliers, 
another, consistent with the expectation that one base mis- such as ABI. Hybridized 8-mers are extended by a labeled 
matches are more destabilizing for 10-mers than 12-mers. dideoxynucleotide. After an optional purification step, Le., 
Hybridization results within probe sets perfectly matched to with a size exclusion column, the labeled 9-mers are b>^rid- 
target also followed the expectation that, the more matches 50 ized to the chip. Other methods of target fragmentation can 
the individual probe formed with the target, the higher the be employed. The single-stranded DNA can be fragmented . 
signal. However, duplexes with two 3' dangles (see FIG. 23, • by partial degradation with a DNAsc or partial depurination '' 
position 6 in graphs 1-4) have about as much signal as the with acid. Labeling can be accomplished in a separate step, 
probes which are matched along their entire length (see FIG. Le., fluorophore-labcled nucleotides are incorporated before 
23, position 7, in graphs 1-4). 55 the fragmentation step or a DNA binding fluorophore, such 

This illustrative model system shows that 12-mer targets as ethidium homodimer, is attached to the target after 
that differ by one base substitutions can be readily distin- fragmentation. 

guished from one another by the novel probe array provided In one embodiment, the DNA chip has an array of 10^ to 
by the invention and that resolution of the different 12-mer 10* probes tiling across the protein coding regions of p53, 
targets was somewhat belter with the 10-mer probe sets than 60 which comprise about 1200 bp; smaller arrays specific for 
with the 12-mcr probe sets. The value of having several the 600 bp mutational hot spot region are also useful. The 
overlapping probes hybridizing to a taiget demonstrates the probes overiap for N-2 to N-4 bases, where N is the length 
value of the muldple hybridization events that take place on of the probe in bases. N is typically 10 to 14 bases long, but 
a DNA chip of the invention. The results also demonstrate as will be seen below, probes 15 to 19 bases and longer are 
the fcasibiUly of constructing a probe set to sequence the 65 also useful. Every possible single base substitution occur- 
cotire 1.4 kbp protein coding region of p53 or alternatively ring one at a time is represented in the array. The number of 
the 0.6 kbp of cxons 5-9 containing mutation hot spots. unique 10-mer probes with 7 base overlaps would be about 
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(1200/3)x4xl0 or about 1.6x10*. To allow 3 replicates of of DNA. First, the target DNA is amplified by PCR with 

each probe, one might have a total array size on the order of primers allowing easy ligation into a vector, which is taken 

4.8x10* probes. Of course, anays of probes within the «p by transformation of E. coU which in turn must be 

ranges of 10^ to 10" probes are also useful for applications; cultured. typicaUy on plates overnight. After growth of the 

for example, very l^c arrays of lO' or more probes are 5 bacteria DMA ^ purified m a procedure that typically takes 

* ^ t-t' 1 aboul2hours;theD. the seoucQcmg reactions arc pcrionncd, 

: usetui tor sequencing or wquence pneciang taige gcnotn c ^ . - j^^, ^^^tt..', b^u, ^id the samples are run on 

D.^^^K^t't^M^ ? fragmented ^f j^^f^^^^^^ .he gel for several hours, the duration depending on the 
nucleic acid hybridized to the chip is detected by a confocal ^ ^ sequenced. By contrast, the 
microscope or other imagmg device. The paUcrn of sites eient invention provides direct analysis of the PCR a cnpU- 
"lighting up" with Urget is preferably analyzed with com- 10 ^^^ material after brief -transcription and fragmentation 
puter assistance to provide the sequence of the target from ^^^^^^ ^^^g ^^y^ i^^^^ 
the pattern of sites producing signals. ^ interesting clinical application for the characterization 
The invention is illustrated below with examples of DNA heterozygous mutations with DNA chips is as foUows. 
chips comprising very large arrays of DNAprobcs to "rese- individuals with germlinc cancer mutations have a very high 
quence" p53 target nucleic acid in a sample. To analyze is ^.^^ secondary tumors after Ueatment by inadiation. 
DNA from cxon 5 of the p53 tumor suppressor gene, a set ^^^^^ ^q^^ ^^^^^^ patients may have germline muta- 
of overlapping 17-mer probes was synthesized on a chip. ^^^^j. ^^^j. suppressor genes. Thus, before 
The probes for the WT allele were synthesized so as to tUe deciding on a treatment modaHty, a physician could use the 
across the entire cxon with single base overlaps between method and DNA chips of the invention to test for a 
probes. For each WT probe, a sets of 4 additional probes, 20 j^q^ suppressor gene mutation, 
one for each possible base subsUtution at posiUon 7, were j^j^^ ^^^^^ Rational Therapeutic Management 
synthesized and placed in a column relative to the WT probe. present invention also provides DNA chips that can 
Exon 5 DNA was amplified by PCR with primers flanking physicians to determine optimum therapeutic 
the cxon. One of the primers was labeled with fluorescein; protocols by early, rapid detection of biologically mediated 
the other primer was labeled with biotin. After amplification, 25 ^g^jg^ance to a therapeutic agent in a variety of disease states, 
the biotinylatcd strand was removed by binding to strepla- benefits of such DNA chips are many, as the chips will 
yidin beads. The fluoresceinated strand was used in hybrid- ^^^^ physicians recognize health care cost savings, achieve 
iz^llon, ^ rapid therapeutic benefits, limit administration of ineffective 
. About Vi of the amplified, single-stranded nucleic aad resistance) yet toxic drugs, monitor changes in 
was hybridized overnight in 5xSSPE at 60" C to the probe 30 pathogen resistance, and deaease pathogen acquisition of 
•chip (under-a cover slip). After washing with 6>cSSPE, the resistance. Important applications include the Ueatment of 
chip was scanned using confocal microscopy. FIG, 26 shows . juv, other infectious diseases, and cancer, 
an image of the p53 chip hybridized to the Urget DNA. mV has infected a large and expanding number of people, 
Analysis of the intensity data showed that 93.5% of the 184 resulting in massive health care expenditures. HIV can 
bases of cxon 5 were called in agreement with the WT 35 ^^pidly become resistant to drugs used to treat the infection, 
sequence (see Buchman et al., 1988, Gene 70: 245-252, prf^narily due to the action of the hclerodimeric protein (51 
incorporated herein by reference). The miscalled bases were ^ jjjy ^^^^ transcriptase (RT) encoded by 
from positions where probe signal intensities were^ tied ^j^^ ^ j ^^^^ jj^^ jjjg^j ^rror rate (5-10 per round) of 
(1.6%) and where non-WT probes had the highest signal RTprolein is believed to account for the hyperaiutabilily 
intensity (4.9%). FIG. 27 illustrates how the actual sequence 40 nucleoside analogues, i.e., A2T, ddl, ddC, and 
was read. Gaps in the sequence of letters in the WT rows ^^^^ commonly used to treat HIV infection are converted to 
correspond to control probes or sites. Positions at which nucleotide analogues by sequential phosphorylation in the 
bases are miscalled are represented by letters in iulic type in cytoplasm ~ of infected cells, where incorporation of the 
cells corresponding to probes in which the WT bases have analogue into the viral DNA results in tennination of viral 
been substituted by other bases. « j^pii^ation, because the 5*-*3* phosphodiester linkage can- 
As the diagram indicates, the miscalled bases are firom the ^ completed. However, within after 6 months to 1 year 
low intensity areas of the image, which may be due to treatment. HIV typically mutates the FT gene so as to 
secondary structure in the target or probes preventing inter- become incapable of incorporating the analogue and so 
molecular hybridization. To diminish the effects due to rcsisUnt to treatment Several known mutations, are shown 
secondary strucmre, one can employ shorter targets (i.c., by 50 j^^^j^^ j.^^ 

target fragmentation) or use more stringent hybridization 
conditions. In addiUon, the use of a set of probes synthesized 
by tiling across the other strand of a duplex target can also 
provide sequence information buried in secondary structure 



the pattern of low intensity areas that fonns as a result of 
secondary structure in the target itself provides a means to 
identify that a specific target sequence is present in a sample. 
Other factors that may contribute to lower signal intensities 
include differences in probe densities and hybridization ( 
stabilities. 

These results demonstrate the advantages provided by the 

DNA chips of the invention to genetic analysis. As another ^ 

example, heterozygous mutations are currently sequenced mutaixons confer lesiiuna to other dmp u vitro 

by an arduous process involving cloning and rcpurification W _ r j. - 

of DNA. The cloning step is required, because the gel The present mvention provides DNA chips for delecting 
sequencing systems are poor at resolving even a 1:1 mixture the multiple mutations in the HIV RT gene assoaated with 
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resistance to different therapeutics. These DNA chips wiU to gain primary structure information of the DNA target, 

enable physicians to monitor mutations over Ume and to This format has important appHcations in sequencing by 

change therapeutics if resistance develops. The DNA chip hybridization, DNA diagnostics and in elucidating the ther- 

will provide redundant confirmation of conserved fflV RT modynamic parameters affecting nucleic acid recognition, 

and other gene sequences, and the probes on the chip will tile 5 ' Conventional DNA sequencing technology is a laborious 

through/with overlap, in important muUlional hot spot procedure requiring clcctrophorctic size separation of 

regions. The chip will optioiiaUy have probes that span the ■ labeled .DNA fragments. An alternative approach, termed 
entire coding region of the RT arid optionally the genes for' ^ Sequencing By Hybridization (SBH). has been proposed 

other HIVproteins, such as coat proteins. HIV target nucleic (Lysovet al., im,DokLAkad. NaukSSSR 303: 1508^1511; 

acid can be isolated from blood samples (peripheral blood 10 Bains ct al., 1988, /. Theor. Biol. 135: 303-307; and 

lymphocytes or PBMC) and amplified by PGR, primers for Draaanac ct al., 1989, Genomics 4: 114-128, incorporated 

which are shown in the table below. herein by reference). This method uses a set of short 



ampltficattok of target 

Target " ■ 

size primer 1 primer 2 



3, 742bp GTAGAATICrOTrGACrCAGATrGG OATAAGCrTOGGCCTIMCIAmx:^ 

(SEQ ID. NO:292) (SEQ ID, NO:294) ^-..^vvr-r^^ 

5a5bp AAiffCCATACAATACrCCAGTATrrGC ACCXrAItXAAAGGA^tXiAGGTICnTC 

(SEO ID. NO:293) (SEO ID. NO*.295) 

323bp Gcabaak#KD2013 1889-1908 bases 2211-2192 



The HIV RT gene chips of the invention, as well as the CF, 
mtDNA, and p53 DNA chips of the invention, illustrate the 
diverse application of the methods and probe arrays of the 
invention. The examples that follow describe methods for 
preparing nucleic acid targets from samples for application 
to the DNA chips of the invention and provide additional 
details of the methods of the invention. 

: EXAMPLES ■ 

I. VLSIPS™ Technology 

As noted above, the VLSIPS^*^ technology is described in 35 
a number of patent publications and is preferred for making 
the oligonucleotide arrays of the invention. For 
completeness, a brief description of how this technology can 
be used to make and screen DNA chips is provided in this 
Example and the accompanying Figures. In the VLSIPS 40 
method, light is shone through a mask to activate functional 
(for oligonucleotides, typically an — OH) groups protected 
with a pholocemovable protecting group on a surface of a 
solid support. After light activation, a nucleoside building 
block, itself protected with a photoreraovable protecting 45 
group (at the 5* — OH), is coupled to the activated areas of 
the support The process can be repeated, using different 
masks or mask orientations and building blocks, to prepare 
very dense arrays of many different oligonucleotide probes. 
The process is illustrated in FIG. 28; FIG. 29 illustrates how 50 
the process can be used to prepare "nucleoside cpmbinato-. . 
riak" or . oligonucleotides synthesized by Coupling all four , 
nucleosides to fomi dimers, trimers, etc. 

New methods for the combinatorial chemical synthesis of 
peptide, polycarbamate, and oligonucleotide arrays have 55 
recently been reported (see Fodor et aL, 1991, Science 251: 
767-773; Cho ct al., 1993, Science 261: 1303-1305; and 
Southern ct al, 1992, Genomics 13: 1008-10017, each of 
which is incorporated herein by reference). These arrays, or 
biological chips (see Fodor ct al., 1993, Nature 364: 60 
555-556, incorporated herein by reference), harbor specific 
chemical compounds al precise locations in a high-density, 
information rich fonnat, and are a powerful tool for the 
study of biological recognition processes. A particularly 
exciting application of the array technology is in the field of 65 
DNAsequencc analysis. The hybridization pattern of a DNA 
target to an array of shorter oligonucleotide probes b used 



oligonucleotide probes of defined sequence to search for 
complementary sequences on a longer target strand of DNA. 
The hybridization pattern is used to reconstruct the target 
DNA sequence. It is envisioned that hybridization analysis 
of large numbers of probes can be used to sequence long 
stretches of DNA. In immediate applications of this hybrid- 
ization methodology, a small number of probes can be used 
to interrogate local DNA sequence. 

The strategy of SBH can be illustrated by the following 
example. A 12-mer . target DNA sequence, 
AGCCTAGCTGAA, (SEQ. ID NO:296) is mbced with a 
complete set of octanucleotide probes. If only perfect 
complementarity is considered, five of the 65^36 octamer 
probes -TCGGATCG, CGGATCGA, GGATCGAC, 
GATCGACr, and ATCGACIT will hybridize to the target. 
Alignment of the overlapping sequences firom the hybridiz- 
ing probes reconstructs the complement of the original 
12-mcr target: 



TCGGATCG 
CX3GATCGA 
GGATOGAC 
GATOGACr 
ATCXjACrr 
TOGGATCOACrr (SEQ. ID NO:2»7} 



Hybridization cacthodblogy can be carried out by attaching ' 
target DNA to a surface: The target is interrogated wilh.a set 
of oligonucleotide probes, one at a time (sec Strezoska et al., ' 
1991, Proc Natl, Aca± Set. USA 88: 10089-10093, and 
Drmanac et al., 1993, Science 260: 1649-1652, each of 
which is incorporated herein by reference). This approach 
can be implemented with well established methods of immo- 
bilizalion and hybridization detection, but involves a large 
number of manipulations. For example, to probe a sequence 
utilizing a full set of octanucleotides, tens of thousands, of 
hybridization reactions must be performed. Alternatively, 
SBH can be carried out by attaching probes to a surface in 
an array formal where the identity of the probes at each site 
is known. The target DNA is then added to the array of 
probes. The hybridization pattern determined in a single 
experiment directly reveals the identity of all complemen- 
tary probes. 
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As noted above, a preferred method of oligonucleotide oflheprobeswUgenerate detectable sipaU Mod^^ng^^^ 
prob^ array synthesis Evolves the use of light to direct the above expression for N. one amves at a relationsbp esti- 
s^Sfte "of ^nucleotide probes in higWensity. minia- mating the number of ^electable bybnf^-.''-^^2itv C 
turized arrays. Photolabile 5'-protected N-acyl- DNA target of length Lt and an array of complexity C. 

ieoxynucleos'ide phosphoramidites. surface linker 5 °« ^VrnT.r? iT^ ^^^^ 

chemistry, and versatile combinatorial synthesis strategies . background: Nd"Cl+f(C-l»t"-g.p-j;j. • . . . 

have been developed for this technology. Matrices of • Arrays ofoUgonvcleotides caa be cEBciently generated by 
spltially defined oligonucleotide probis have been light-directed synthesis and can be used to determine the 
eenerated and the ablty to use these arrays to identify identity of DNA target sequences. Because combmatonal 
S?tn;ntary sequen^s has been demonstrated by lO strategies are used, the number of compounds ^increases 
fluore^ent labeled ohgonucleotides to the DNA exponentially whfle the number of chemical coupLng o^cles 
JK^duS'S^the methods. The hybridization pattern increases only linearly ^^Jf^^^^^'^^P^^^^^^ 
detSoLates a h|h <'egr« cf b^spccifidty and reveals -/"J ^trst ^ e^^^^^^^^^^^^^ 
'''S^^:J:^^X:SS6oU,^.cl^6ec syn. as Seles" Ucrmore. combinrorial synthesis strategies can 

thl (iT^oSed k FIG 28.^m ^r'^f''^'''''VT''""T"^^^^^^ 

support modified with photolabile protecdng groups (X) is Fo'«j'«P'«^t'«"'f,S"^^^^^^^ 

illuminated through a photolithographic mask, yielding can be produced m 48 photolysa 

KaXe hydroxyl gmuj in the iuLinated regions. A compoundsreqmrcsbxncycl«) any subset of the d^^^^^^^^ 

?.S4hoSii acdvated deoxynudeoside (protected 20 crs i^^'-^^^^^^y^'^'^'lit^^^^ 
allhe5'.hydroxylwithaphotolabnegtoup)isthenpreseated constructed wth the correct lithographic mask design m« 

SSesufSd^upliSgoccursat^site/Lwere^ose^ f''^" ^ if S^^t 

to Ught. Following capping, and oxidation, the substrate is compounds ,n an array ^J^^^°fy^y^^^^^, 

rinsed and the surface illuminated through a second mask, to synthesis sites and the overall af ray Re«nt expen 

Sse additional hydroxyl groups for'coupling. A second « menls have f«"°?!^^*^f 

S'-orotected 3'.0-phosphoramidite activated deoxynudeo- sized in 25 fm sites. At this resolut on, the enure set oi 

fidfb panted tS rSrfa«. THe selecUve photodepro- 65.536 octanucleotides «° ^- f ^^^.^/^^'L^^^^ 

taction and coupling cycles are repeated until the desired set 0.64 cm square, and the set of 1.048.576 dodecanudeotides 

- J * • rcQuircs only a 2 jo cm array. 

St dtctelScal synthesis lends itself to highly 30 lenomesequencbgprojects will ulttoately be limited 

effiS srSsSTeiies which will generate a maximum . DNA sequendng technologies. Current sequencing method- 

Sr oSoSS a miiimum Lmber of chemical ologies are Wgt'ly{'=l»°'°° ^'^P^^P^^^'^^^^^^^^^^ 
sterFor e3eTe complete set of 4n polynudeoades substantial manual effort. Sequencing by hybridiza ion has 

g'th i or^^^^^^ P«'«"«lf°5 i^LTedTri ts' S'SS 

4xn chemical steps See HG. 29. TTie patterns of illumina- 35 into more efScient and automated formats Light-directed 

firna1XorSch\m?calreac.ants'ul.imat^ ^^^f'^ ''^X SbT^^^^^ 
products and their locations. Because photolithography is 

Ssed, the process can be miniaturized to generate high- not hinited to primary ^^^""f'^f !PP^"' ""^Z 

Tns ty arrays of oUgonudeotide probes. For an example of single base changes cause 

5,e nomenchmre usfful for descn'ing such arrays, an array 40 feation P»"f ""'^^ J^^^^ 

containing aU possible octanucleotides of dA and dT is ful means to check the '^"i^y °f PJ^^^ 

written as (A+1?. Expansion of this polynomial reveals the DNAsequencc. or to scan for changes w*"" ^^""^^ 

Syofd^^6ocUDucleotidepro4f^mAAAAAAAA ^^<='>^^\<>'^T^r''^f-"^^l"^r^^n^^^^^ 

mW^^^ thf M 6l.36Lanucle- ^^^.J-^f^.^,-^^^^^ 

. aUowed to proceed for »4 to 2 hours. The surface of toe "^^^^ °' . ^^,ru 'ted ,o contain genetic markers for 

matrix is scanned in an epifluorescence microscope (Zeiss arrays can aao oe consffucica lo^ b pathoeenic 

Axioscop 20) equipped with photon counting electronics the rapid identification of a wide variety ol pattiogemc 

usine 50-100 iM of 488 nm excitation from an Argon ion 55 organisms. 

Ssp^c™ Physics model 2020). All measurements are "^.^^.^''^^^^^^^ °teS^ 

.cquiJelwiththeW.solutionin«ntactwJhthe^^ gSntcfn t'd^i^c^to^^^^^^^^^^^^ 

matrix. Photon counts are stored «,d image files a e pre- «f ""'^^.^^J'^i^^^^ s,nictu«s or to investi- 

«nted after conversion to an eight bit image fomiat. See ^ ""''^^^^^f 3° ynthetic nucleoside analogs for 

When hybridizing a DNA target to an cUgonucleotide antisensc or triple helix aPP"""""^ S^i^^^y P~'^^ 

DNA tLe t. «idZ fa the length of the oUgonudeotide ing the fliermodynam.c and faneUc rules goveramg forma- 
ppXp^Forexamp^^^^^^^^ « ^^^^i^^Sl^f^,^^, 

'^.'^.^TAt^^LTJSSZ ^IH:^ «.e nucleoside coupling^chemistry is very simflar to that 
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iised routinely today for oligonucleotide synthesis. FIG. 30 
shows the deproteclion, coupling, and oxidation steps of a 
solid phase DNA synthesis method. FIG. 31 shovys an 
illustrative synthesis route for the nucleoside building blocks 
used in the method. FIG. 32 shows a preferred photorcmov- 
able protecting grpiip, M6NP0C, and how to prepare the 
group in active form. The procedures described beloW.show 
how-to prepare these reagents. The nucleoside building 
blocks arc 5'-MeNPOC-THYMIDINE-3»-OCEP; 
5*-McNP0C-N'*-t.BUTYL PHENOXYACETYL- 
DE0XYCYTIDINE.3'-0CEP; 5'-MeNP0C-N^-l-BUTYL 
PHENOXYACETYL-DEOXYGUANOSINE-3'.OCEP; 
and 5*-McNP0C-N^-l-BUTYL PHENOXYACETYL- 
DE0XYADEN0SINE-3*-0CEP. 

A. Preparation of 4, 5-methylencdioxy-2-Ditroacct6phcnone 




10 



20 



30 



minimum volume of CHjCIj or THF(-175 ml) and then 
precipitating it by slowly adding hexane (1000 ml) while 
stirring (yield 51 g; 80% overall). It can also be rccrystal- 
lizcd (eg., toluenc-hexane), but this reduces the yield. 
C. Preparation of l-(4^- mcthylcncdioxy-2-nitrophcnyl) 
ethyl chloroformale (MeNPOC-a) 



coa2 ^ 

Tolucnc/THF'^ 





A solution of 50 g (0.305 mole) 3,4- 25 
mcthylenedioxyacetophenone (Aldrich) in 200 mL glacial 
acetic acid was added dropwisc over 30 minutes to 700 mL 
of cold (2-4** C.) 10% HNO3 with stirring (NOTE: the 
reaction will overheat without external cooling from an ice 
bath, which can be dangerous and lead to side products). At 30 
temperatures below 0" C, however, the reaction can be 
sluggish. A temperature of 3'*-5'*.C. seems to be optimal). 
The mixture was left stirring for another 60 miinutcs at 3*-5* 
C, and then allowed to approach ambient temperature. 
Analysis by TLC (25% EtOAc in hexanc) indicated com- 35 
plctc conversion of the starting material within 1-2 hr. When 
the reaction was complete, the mixture was poured into -3 
liters of crushed ice, and the resulting yellow solid was 
filtered off, washed with water and then suction-dried. Yield 
—53 g (84%), used without further purification. 40 
B. Preparation of l-(4,5-Methylenedioxy-2-nitrophenyl) 
ethanol 




NO2 



OH 



NaBH4 
EtOH 




45 



50 



' Sodium borbhydride (10 g; 0.27 mbl) was added slowly 
lo a cold, stirring suspension of 53 g (0.25 mol) of 4,5- 
metbyIenedtoxy-2-nitroacetophenone in 400 mL methanol. 
The temperature was kept below 10" C. by slow addition of 55 
the NaBH4 and external cooling with an ice bath. Stirring 
was continued at ambient temperature for another two hours, 
at which time TLC (CH^Clj) indicated complete conversion 
of the ketone. The mbcture was poured into one liter of 
ice-water and the resulting suspension was neutralized with 60 
ammonium chloride and then extracted three times with 400 
mL CH2CI2 or EtOAc (the product can be collected by 
filtration and washed at tlus point, but it is somewhat soluble 
in water and this results in a yield of only -60%). The 
combined organic extracts were washed with brine, then 65 
dried with MgS04 and evaporated. The crude product was 
purified from the main byproduct by dissolving it in a 



Phosgene (500 mLof 20% w/v in toluene from Fluka: 965 
mmole; 4 eq^ was added slowly to a cold, stirring solution 
of 50 g (237 mmole; 1 cq.) of l-(4,5-methylcnedioxy-2- 
nitrophenyQethanol in 400 mL dry THF. The solution was 
stirred overnight at ambient temperature at which point TLC 
(20% El20/hexanc) indicated >95% conversion. The mix- 
ture was evaporated (an oil-less pump with downstream 
aqueous' NaOH trap is recommended to remove the excess 
phosgene) to afford a viscous brown oil. Purification, was 
effected by flash chromatography* on a ishort (9x13 cm) 
column of silica gel eluted with 20% EijO/hexane. Typically 
55 g (85%) of the solid yellow MeNPOC-Cl is obtained by 
this procedure. The crude material has also been reccystal- 
lizcd in 2-3 crops from 1:1 cther/hexane. On this scale, -100 
ml is used for the first crop, with a few percent THF added 
to aid dissolution, and then cooling overnight at -20* C. (this 
procedure has not been optimized). The product should be 
stored dessicatcd at -20* C. 

D. Synthesis of 5*-McNPOC-2*-DEOXYNUCLEOSIDE-3'- 
(N,N-DIISO?ROPYL 2-CYANOETHYL PHOSPHORA- 

MIDITES 
(1) 5'-MeNPOC-Nucleosides 



Pyndioe ^ 




MenpocO' 




Base 



Base-THYMIDINE (T); N-4-IS0BUTYRYL 
2'-DE0XYCyTlDINE (ibu-dC); N-2.PHEN0XYACETYL 
2'DEOXYGUANOSINE (PAC-dG); and N-6- 
PHENOXYACETYL 2'DEOXYADENOSINE (PAC-dA) 

All four of the 5*-MeNP0C nucleosides were prepared 
from the base-protected 2'-deoxynucleosides by the follow- 
ing procedure. The protected 2'-deoxynuclcoside (90 
mmole) was dried by co-evaporating twice with 250 mL 
anhydrous pyridine. The nucleoside was then dissolved in 
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300 mL anhy-drous pyridine (or 1:1 pyridiae/DMF, for the 
dG"^^ nucleoside) under argon and cooled to -2** C in an 
ice bath. A solution of 24.6 g (90 mmolc) MeNPOC-Ci in 
100 mL dry THP was then added with stirring over 30 
minutes. The ice bath' was removed, and the solution allowed 5 
to stir overnight at room temperature (TLC: 5-10% MeOH 
in CHiCli; >v6 diastercomers). After evaporating the sol-^ 
vedts under vacuuno, the crude material was taken up in 250 
mL ethyl acetate and extracted with saturated aqueous 
NaHCO^ and brine. The organic phase was then dried over jq 
Na,S04. filtered and evaporated to obtain a yeUow foam. 
The crude products were finally purified by flash chroma- 
tography (9x30 cm siUca gel column elutcd with a stepped 
gradient of 2?S-6% MeOH in CH^Cl^). Vields of the pun- 
ficd diastercomeric mixtures arc in the range of 65-75%. 

f 2^ 5 '.MeNPO C-2'-DE0X YNU CLEOSID E-3*-(N,N- 
DIISOPROPYL - 2.CYAN0ETHYL 
PHOSPHORAMIDITES) 



15 



MeapocO' 




Base 



/'Amlditing rcagenr 
DIEA/DCM 



20 



McnpocO' 




Base 25 



/ 



P— OCH:CH2CN 30 



The four dcoxynucleosides were phosphitylatcd using 35 
cither 2-cyanocthyl-N.N-diisopropyl 
chlorophosphoramidile, or 2-cyanoethyl-N,N,N*,N[- 
tetraisopropylphosphorodiamidite. The foUowing is a typi- 
cal procedure. Add 16.6 g (17.4 ml; 55 ^0 s';cW PC^^^^ 

2-cyanoethyl.N.NJ^^N'.tetraisopropylphosphorodiamid^^ 40 ^Oscc^oijck^ mitochondrial DNA fragment but are 
to absolution of 50 mmole S^MeOTOC-nucleo^^^^ and 4.3 ^^^TO ^^^^^ in the absencTof a visible 

g (25 mmole) diisopropylammotnum tetrazoUde ^ band on an agarose gel, there should still be enough 



For products in the 200 to 1000 bp size range, check 2/^1 
of the reaction on a 1.5% 05xTBE agarose gel usmg an 
appropriate size standard (phiX174 cut with HacIII is 
convenient). The PGR reaction should yield several pico- 
molcs of product. It is helpful to include a ncgauve control 
(i e 1 ;il TE instead of genomic DNA) to check for possible 
contamination. To avoid contamination, keep PGR products: 
from previous experiments away from later reactions, using • 
filter tips as appropriate. Using a set of working solutions 
and storing master solutions separately is helpfu^so long as 
one does not contaminate the master stock solutions. 

For simple amplifications of short fragments from 
genomic DNA it is, in general, unnecessary to optimize 
Mg** concentrations. A good procedure is the following: 
mSce a master mix minus enzyme; dispense the genomic 
DNA samples to individual tubes or reaction wells; add 
enzyme to the master mix; and mix and dispense the master 
solution to each well, using a new filter tip each time. 

2) PURinCATION r 
Removal of unincorporated nucleotides and pnmers from 

PGR samples can be accomplished using the Promega 

Magic PGR Preps DNA purification kit. One can purify the 

whole sample, foUowing the instructions supplied with the 

kit (proceed from section IDB, 'Sample preparation for 

diredt purification from PGR reactions'). After clution of the 

PGR product in 50 //I of TE or H^O. one centrifuges the 

cluatc for 20 sec at 12,000 rpm in a microfuge and carcfuUy 

transfers 45 /il to a new microfuge tube, avoiding any visible 

pellet. Resin is sometimes carried over dunng the clulion 

step This Uansfer prevents accidental contamination of the 

linear amplification reaction with 'Magic PGR' resin. Other 

methods, e.g. size exclusion chromatography, may also be 

used. . ■ • ■ • 

3) UNEAR AMPUnCAnON . -* 

In a 0.2 mL thin-wall PGR tube mix: 4 /zl punficd PGR 
product; 2/d primer (10 pmoV//l); 4/il lOxPGR buffer 4^1 
dNTPs (2 mM dA. dC, dG, 0.1 mM dl); 4/zl 0.1 mM dUTP; 
1 wl 1 mM fluorescein dUTP (Amersham RPN 2121); 1 U 
Taq polymerase (Pcrkin Ehncr, 5 U/;^l); and add HjO to 40 
/4l Conduct 40 cycles (92« C. 30 sec, 55* C. 30 sec, 72 C. 



45 



50 



dry CHiQj under argon at ambient temperature. Continue 
stirring for 4-16 hours (reaction monitored by TLC: 
45:45:10 hexanc/GHiCli/EtaN). Wash the organic phase 
with saturated aqueous NaHGOj and brine, then dry over 
Na-SO.. and evaporate to dryness. Purify the crude amiditc 
by flash chromatography (9x25 cm sUica gel column cluted 
vrith hexane/CH^Cl,AEA-45:45:10 for A, C, T; or 0:90:10 
for G). The yield of purified amidite is. about 90%. 
H. PREPARATION, OF. LABELED DNA/ 
:HYBRIDIZAnON TO ARRAY 
1) PGR 

PGR amplification reactions arc typically conducted in a 
mixture composed of per reaction: 1 fil genomic DNA; lOjd 55 
each primer (10 pmoVjul stocks); 10//1 lOxPGR buffer (100 
mM Tris.a pH85, 500 mM KQ, 15 mM MgCl^); 10 /d 2 
mM dNTPs (made from 100 mM dNTP stocks); 2.5 U Taq 
polymerase (Pcrkin Ekner AmpbTaq*™, 5 U//il); and H^O to 
100 u\. The cycling conditions are usually 40 cycles (94 C. 
45 sec, 55' C. 30 sec, 72* C 60 sec) but may need to be 
varied considerably from sample type to sample type. These 
conditions are for 0.2 mLthin waU tubes in a Pcrkin Elmer 
9600 thcrmocyclcr. Sec Perkin Ehncr 1992/93 catalogue for 



60 



product band on an agarose gel, there should still be enou^ 
product to give an easily detectable hybridizaUon signal. If 
one is not treating the DNA with uracil DNA glycosylase 
(see Section 4), dUTP can be omitted from the reacUon. 
4) FRAGMENTATION 

Purify the linear amplification product using the Promega 
Masic PGR Preps DNA purification kit, as per Section 2 
above In a 0.2 mL thin-wall PGR tube mbc: 40 /d punfied 
labeled DNA; 4 p\ lOxPCR buffer, and 0.5 /xl uracU DNA 
olycosylasc (BRL lU//il). Incubate the mixture 15^inm at 
37* C, then 10 min at 97* C; store at -20** C. unin ready 

5rHYBRIDI2AnON SCANNING & STOIPPING 
A blank scan of the slide in hybridization buffer only is 
helpful to check that the sHde is ready for use. The buffer is 
removed from the flow ccU and replaced with 1 mL of 
(fraemenled) DNA in hybridization buffer and mixed weU. 
ThcsUn is performed in the presence of the labeled target. 
FIG 33 illustrates an illustrative detccUon system for scan- 
nine a DNA chip. A series of scans at 30 min intervals using 
a hybridization temperature of 25* C. yields a very clear 
sienaL usually in at least 30 min lo two hours, but it may be 



9600 thermocyder. See Perkin Elmer 1992^3 «Ulog^« ^ dSle to hybridi2e longer. i.c.. overnight. Using a laser 
9600 cycle time information. Target, pnmer length and «5 ^^^^^ '^^y^^ ^0 ^ pixels, one should obtain 
sequence cocposilion. among other factors, may also affect pcwer^ « ^ ^^^^ ^^^^ ,^ tousands/ 

parameters. 
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33 34 .^^^ 

pixel for a new slide. When finished, the slide can be 30 sec) are performed, but cycling conditions may need to 
stripped using 50% fonnamide. rinsing well in deionized be varied. ITiese conditions arc for 0.2 niL thin wall tubes in 
HjO, blowing dry, and storing at room temperature. Pcrkin Elmer 9600. For products in the 200 to 1000 bp size • 

III, PREPARATION OF LABELED RNA rangc,check2/dof the reaction on a 1.5% 0.5xTBE agarose 
/HYBRIDIZATION TO ARRAY 5 gel using an appropriate size standard. For larger or smaller 

1) TAiGGED PRIMERS. / volumes (20-100 //I), one can use the same amount of 

• . The primersuscd to aniplify^thc target nucleic acid should genomic DNA but adjust the other ingredienU accordingly. . 
have promoter sequences if one desires to produce RNA ' 4) IN VITRO TRANSCRIPTION .: 
from the amplified nucleic acid. Suitable promoter Mix:3//lPCRproduct;4/d5xbufifer;2;/lDTr;2.4/d 10 
sequences are shown below and include: JO mM rNTPs (100 mM solutions from Pharmacia); 0.48 fi\ 10 

(1) the T3 promoter sequence: mM fiuorescein-UTP (Fluorescein-12-UTP, 10 mM 
5'-CGGAArTAACCCTCACTAAAGG (SEQ. ID NO:29S) solution, from Boehringer Mannheim); 05 pi RNA poly- 
S'-AATTAACCCrCACTAAAGGGAG; (SEQ. ID NO:299) merase (Promega T3 or T7 RNA polymerase); and add HjO 

(2) the T7 promoter sequence: to 20/il. Incubate at ST C. for 3 h. Check 2;d of the reaction 
5' TAATACGACrCACrATAGGGAG; (SEQ. ID NO;300) 15 on a 1S% 0.5xTBE agarose gel using a size standard, 
and (3) the SP6 promoter sequence: • 5xbuffer is 200 mM Tiis pH 75, 30 mM MgCl^, 10 mM 
5' ATTTAGGTOACACTATAGAA. (SEQ. ID N0:301) - spermidine, 50 mM NaCl, and 100 mM DTT (supplied with 
The desired promoter sequence is added to the 5' end of the enzyme). The PCR product needs no purification and can be 
PCR primer. It is convenient to add a different promoter to added directly to the transcription mixture. A20 //I reaction 
each primer of a PCR primer pair so that cither strand may 20 is suggested for an initial test experiment and hybridization; 
be transcribed from a single PCR product. a 100 reaction is considered "preparative" scale (the 

Synthesize PCR primers so as to leave the DMT group on. reaction can be scaled up to obtain more target). The amount 
DMTK)n purification is unnecessary for PCR but appears to of PCR product to add is variable; typically a PCR reaction 
be important for transcription. Add 25 /^l 05M NaOH to will yield several pioomoles of DNA. If the PCR reaction 
collection vial prior to coUeclion of oligonucleotide to keep 25 does not produce that much target, then one should increase 
the DMT group on. Deprotccl using standard chemistry— the amount of DNA added to the transcription reaction (as, 
55* C overnight is convenient. weU as optimize the PCR). The ratio of fiuorescein-UTP to 

HPLC purification is accomplished by drying down the UTP suggested above is 1:5, but ratios from 1:3 to 1:10— all 
oHgonucleotides, rcsuspending in 1 mLO.lM TEAA(daute work weU. One can also label with biotin-in? and detect 
2.0M stock in deionized water, filter through 0.2 micron 30 with streptavidin-HTC to obtain similar results as with 
filter) and filter through 0.2 micron filter. Load 0.5 mL on fiuorescein-UTP detection.. - . cnr 

reverse phase HPLC (column can be a Hamilton PRP-1 • For. nondenaturing agarose gel electrophoresis of RNA, 
semi-prep, #79426). The gradient is 0^50% CH3CN over note that the RNA band wiU normally migrate somewhat -. 
25 min O^ogram 0.2 ;miol.prep.Q-50, 25 min). Pool the faster than the DNA template band, although sometimes the 
desired fractions, dry down, resuspend in 200/4 80% HAc. 35 two bands will comigrate. The temperature of the gel can 
30 min RT. Add 200 /d ElOH; dry down. Resuspend in 200 effect the migration of the RNA band. The RNA produced 
u\ njO, plus 20 u\ NaAc pH55, 600 ^ EtOH. Leave 10 min from in vitro transcription is quite stable and can be stored 
on ice; ccntrifiige 12,000 rpm for 10 min in microfiigc. Pour for months (at least) at -20« C without any ci^dcncc of 
offsupematanLRinscpcUelwithlmLEtOH.dry,resuspend degradation. It can be stored m unstenlizcd 6xSSPE 0.1% 
in 200/^1 H20. Dry, resuspend in 200 pi TE. Measure A260, 40 triton X- 100 at -20*» C. for days (at least) and reused twice 
prepare a 10 pmoV/il solution in TE (10 mM Tris.Q pH 8.0, (at least) for hybridization, without taking any special pre- 
0.1 mM EDTA). FoUowing HPLC purification of a 42 mer, cautions in preparaUon or durmg use. RNase contaminaUon 
a yield in the vicinity of 15 nmol from a 0.2 //mol scale should of course be avoided. When cxtractmg RNA from 
synthesis is typical. cells, it is preferable to work very rapidly and to use strongly 

2) GENOMIC DNA PREPARATION « denaturing conditions. Avoid using glassware previously 
For obUining genomic DNA from human hair, one can contaminated with RNases. Use of new disposable plas- 

cxtract as few as 5 hairs, including hair roots. On a clean and ticwarc (not necessarily sterilized) is preferred, as new 
sterile surface, one places the hair on a piece of parafilm, and plastic tubes, tips, etc., are cssentiaUy RNasc free. Treatment 
after wiping a new razor blade with EiOH cutting off the with DEPC or auloclaving is typicaUy not unnecessary, 
roots, the roots are transferred to a 15 . mLtnicrofiige tube 50 5) FRAGMENTATION , ^ „ , • 

using a pair of Millipore forceps cleaned with EtOH. Add * . In a 0.2 mL thin-wall PCR tube mix: 18 RNA 
500;il (10 mM Tris.a pH8.0, 10 mM EDTA, I06 mM NaQ, . from transcription reaction-^io purification require^; 18 /a. 
2% (w/v) SDS, 40 mM DTT, filter sterilized) to the sample. ' H^O; and 4^1 IM Tris.a pH9.0. Inoibatc at 99.9 C. for 60 
Add 1.25 «l 20 mg^ml proteinase K (Boehringer) Incubate at min. Add to 1 mL hybridization buffer and store at -20 C. 
55* C. for 2 hours, vortexing once or twice. Perform 2x0.5 55 until ready to use. The alkaUne hydrolysis step is very 
mL 1:1 phenokCHCL extractions. After each extraction, rehable. The hydrolysed target can be stored at -20 C. m 
centrifuge 12,000 rpm 5 min in a roicroftige and recover 0.4 6>cSSPEA).l% Triton X-lOO for at leaslscveral days pnor to 
mL supcmaunt. Add 35 NaAc pH5.2 plus 1 mL BOH. use and can also be w^sed. ^rnn^n 
Place iample on ice 45 min; then antrifiige 12,000 ipm 30 6) HYBRIDIZAnON SCANNING, & STOPPING ^ 
min, rinse, air dry 30 min, and resuspend in 100 //I TE. 60 A blank scan of the slide in hybndizaUon buffer o^y is 

3) PCR helpftil to check that the slide is ready for use. The buffer IS 

PCR is performed in a mixture containing, per reacUoo: 1 removed from the flow cell and rq)laced with^ 1 mL of 
Ad genomic DNA; 4//1 each primer (10 pmoV/il slocks); 4/^1 (hydrolysed) RNA in hybridization buffer and mixed wcU. 
10 xPCR buffer (100 mM Tris.a pH85, 500 mM KQ, 15 Incubate for 15-30 mm at 18* C Remove the hybndizaUon 
mM MgCL); 4 ;d 2 mM dNTPs (made from 100 mM dNTP 65 solution, which can be saved for suteequenl cxpcnmcnte. 
stocks); lUTaq polymerase (Pcrkin Elmer, 5 U//d);HaOto Rinse the flow cell 4-5 tunes with frj^chan^^^ 
AO Ml About 40 cVcles (94" C. 30 sec, 55* C 30 sec, 72* C 0.1% Triton X-100, cquflibratcd to 18 C The nnscs can be 
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performed rapidly, but it is important to empty the flow ce 
before each new rinse and to mix the hquid m the cell 
thoroughly/The scan is performed in the presence of the 
labeled target. A series of scans at 30 min intervals using a 
hybridization temperMure of 75^ C yields a very clear 
signal, usually in at least 30 min to two hours, but it may be 
desirable to hybridize longer, i.e., overnight, Usmg a laser 
power of 50 /iW and 50 pxsi pixels, one should obtain 
maximum counts in the range of hundreds to low thousands/ 
pixel for a new slide. When finished, the slide can be 
stripped using 50% to 100% formamide at 50° C. for 30 mm. 
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rinsing well in deionized HjO, blowing dry, and storing at 
room temperature. - . 

These conditions are illustrative and assume a probe 
length of -15 nucleotides. The stripping conditions sug- 
gested are fairly severe, but some signal may remain on the 
sHde if the washing is not stringent. Nevertheless, the counts 
remaining after the wash should be very low in comparison 
to the signal in presence of target RNA. In some cases, much 
genUer stripping conditions are efiEective. The lower the 
hybridization temperature and the longer the duration of 
hybridization, the more difBcult it is to strip the sHde. Longer 
targets may be more dif&cult to strip than shorter targets. 



( 1 ) GENERAL INFORMAnON: . . . , . 

( ! i I ) NUMBER OF SEQUENCES: 360 



SEQUENCE LISTINO 



( 2 ) INFORMATION FOR SEQ m N0:1: 

( i ) SEQUENCE CHARACraWSnCS: 
( A ) LENCFH: 15 base pain 
( B ) TYPE: nudcic »dd 
( C ) STRANDEDNESS: aa^c 
( D )TOPOU0GV: linear 

( 1 I ) MOLECULE TYPE: DNAOp»T>e) 

( 3c I ) SEQUENCE DESCRIPTION: SEQ ffi N0:1: 

TTGCTGACGT CAGCC 



1 5 



( 2 ) INFORMATION FOR SEQ IDNO-J: . _ 

( I ) SEQUENCE CHARACTERISrnCS: 
( A ) LENGTH: 15 base pahs 
( B ) TYPE: oodek acid 
( C ) STRANDEDNESS: dn^e 
( D ) TOPOLOGY: lioear 

( J i ) MOLECULE TYPE: W*A (probe) 

( X I ) SEQUENCE DESCRIPnON: SEQ ID N0:2: 

TTGCTGACAT CAGCC 



I 5 



( 2 ) INFORMATION FOR SEQ ID NO-J: 

( I ) SEQUENCE CHARACTERIsnCS: 
( A ) LENCnt IS base paiis 
( B ) TYPE: undeic acid 
( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( X ! ) SEQUENCE DESOUmON: SEQ ID N0:3: 

TTOCTGACCT CAGCC 



1 5 



( 2 ) INFORMATION FOR SEQ ID N0:4: 

( i ) SEQUENCE CHARACIERlSnCS: 
( A ) LENGTH: 15 base pahs 
( B)TYPE:iiBdeicadd 
( C ) STRANDEDNESS: »iB^e 
( D)TOPOL0GY:lbctr 

( i i ) MOLECULE TYPE: DNAOpcobe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
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* ■ ■ ■ . » . ' * ' * 



TTGCTGACTT CAGCC - 



( 2 VlNFORMAnONFORSEQIDNOtS: ;:\ " ^ • ; \ ' ^ " 

( i )SEQUENCECHAIUClERrSTia: . ' , 3: ■ ' . '' - / . - 

< A ) LENGTH: 39 base pairs . ^ . , - . 

( B )TYPEnndeic add "V V. - - 

( C ) STRANDEDNESS: single - 

( D )TOPOLOGY; linear _ ^ • ' 

( I I ) MOLECULE TYPE- DNAColigomicleotidc) 

( X I ) SEQUENCE DESCRIPTION: SEO ID N0:5: ; . . 

CATTAAAGAA AATATCATCT TTGGTGTTTC CTATGATGA 3 9 



( 2 ybn^RMAnONFORSTOrot^ -J VV-; ; " r v: - \ -^v:r: 

( 1 ) SEQUENCE GHARACTERISndS: 

( A)tENGTH:36basepalra • - 

( B ) TVPE: ondeic acid 

< C ) CTRANDEDNESS: single 

{ D )T0P0LOGY: linear 

( I ! ) MOLECULE TIfFEiDNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 



( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pain 
( B )TWE:EucleicBcid 
( C ) STRANDEDNESS: single : . 
( D)TOPOLOGY: linear 

( I ! ) MOLECULE TlfPEiDNA (genomic) 

( X I ) SEQUENCE DESCRIFHON: SEQ ID NO:7: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 



( 2 ) INFORMATION FOR SEQ ID Nafc 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENCriH: 15 base pair* 
( B ) TYPE: nocleic add 
( C ) STRANDEDNESS: sbgle 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNAOprobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:S: 

AACACCAATG ATCAT 



( 2 ) INFORMAnON FOR SEQ ID NOA 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS base pairs 
( B) TYPE: DDddc add 
( C ) SntANDEDNESS: tingle 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

CCAAAGATNA TATTT 



( 2 ) INFORMAnON FOR SBQ ID NO:10: 



3 6 
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( i ) SEQUENCE CHARACTERISTICS: . 

( A ) LENGTH: 15 base pain . 
( B ) TYPE: nucleic acid 
- ( C ) SrniANDEpNESS: «ingle . 
( D ) TOPOLOGY: linear 

' . { i i )MOLECUUETYPE: DNA(pobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ CD NOrlO. 

ACCAA^GANG ■ ATATT ' 

( 2 ) INFORMAnON FOR SEQ ID NO:ll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: IS base pairs 

( B ) TYPE: nucleic acid . _ 

" v 1 7 V ^ = ( C ) STTRANDEDNESS: single ■ : : 

( D ) TOPOLOGY: IbieaT 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

CACCAAAGNT GATAT 



( 2 ) INFORMAnON FOR SEQ ID N0:12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pain 
( B ) TYPE* nucleic acid 
( C ) OTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11 ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

ACACCAAANA TGATA 



( 2 ) INFORMAnON FOR SEQ ID N0:13: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic add 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 
( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:13: 
AACACCAANG ATGAT 



( 2 ) INFORMAnON FOR SEQ ID N0:14: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRimON: SEQ ID N0:14: 

AAACACCANA GATGA 



( 2 ) INFORMAnON FOR SEQ ID N0:15: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 15 base pairs 
( B ) TVPE: nndeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( I i ) MOLECULE TYPE: DNA (probe) 
. ( X i ) SEQUENCE DESCRinriON: SEQ ID N0:15: . • . /, / - 

GAAACACCNA/ AGATC J;;:!: \ : : . : ■ ' . V; ; - ' . . . 



( 2 ) INFORMAnON FOR SEQ CD N0:16: 

-...;Vi,\v v:i[ jj SEQUENCE CHAIUCTERISrriCS: - * .* - /y-C - ;-/"v^- '^-.''-t : - - ^- " . ; • i^^r ^V' ^ - 

( A ) LENGTH: 15 base pairs . " . ' 

~ • i^ v . ( B ) TYPE: nucleic add ^ ; . 

( C ) STRANDEDNESS: shgle . - ' 
( D )TOPOLOGY: linear ■ : . 

( i i )MOLECULE TYPE: DNA (probe) 

-( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:16: -> . - . v^ lj" . : y . - r r^r c . ~ ' r ' 

GGAAACACNA AAGAT - 15 



( 2 ) INFORMAnON FOR SEQ ID N0:17: 



( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pain 
( B ) TYPE: nucleic acid 
( C ) SrmANDEDNESS; »m^e 
< D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRimON: SEQ ID N0:17: 

AGOAAACANC AAAOA „ 15 



( 2 ) INFORMAnON FOR SEQ ID N0:18: . ' . .. . 

( I ) SEQUENCE CHARACTERISTICS: — • ■ 
( A ) LENGTH: 21 baw pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOP0LOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

CCTTCAGAGG GTAAAATTAA G 21 



( 2 ) tNFORMAnON FOR SEQ ID N0:19: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LEKGTH: 21 base pairs 
( B )TYPE: noclelc ocld 
( C ) STRAKDEDNESS: single 
( D )TOPOLOGY: linear 

(II ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:19: 

CCTTCAOAGT GTAAAATTAA G 21 



( 2 ) INFORMAnON FOR SEQ ID NO:20: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 44 base pairs 
( B )TYPE:nodeic add 
( C ) STRANDEDNESS: smgle 
( D ) TOPOLOGY: linear 



( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



rAATACOACT CACTATAGCG AGATGACCTA ATAATGATCG < 

• J . ■' ■■ . > 

( 2 ) mFORMAnON FOR SEQ ID N021: , . \ ' ; ' ' 

> ( i )SEQiJENCECHAiuci^isrncs: . - ■ - " " ; ■ 

. - / (A)LENGTH:43bascpajr5 . . . ,. 

^ ( B ) TYPE: npdeic acid . , - 

( C ) STTRANDEDNESS: sin^e 
: . V ( D)TpPOLOGTftiincar/: v^/ v ■ - ~ 

( i i )MOLECULETYPE:DXA(iffobe) 

( X i ) SEQUENCE DESCRIFnON: SEO lb NO:21: 

TAATACGACT CACTATAGGG AGTAGTGTGA AGGGTTCATA 



( 2 ) INFORMAnON FOR SEQ ID N0-J2: \- . 

i ) SEQUENCE CHARACTERISnnCS: 
( A ) LENGTH: 45 base paiis 
( B ) TYPE: nndeic acid 
( C ) STTRANDEDNESS: singjc 
( D )TOPOLOGY: linear 

( I ! ) MOLECULE TYPE: DKA([»obc) 

( X i ) SEQUENCE DESCRmTON: SEQ ID NO:22: 

CTCGGAATTA ACCCTCACTA AAGGTACTGT GAAGGGTTCA 

( 2 ) [NFORMAnON FOR SEQ ID NO:23: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 base pain 
( B ) TYPE: nodcic acid 

( C )SrrRANDEDNESS:»!naIe .'..-„, ! . . ' V- 

( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:23: 
TAATACGACT CACTATAGGG AGAGCATACT AAAAGTGACT 



( 2 ) INFORMATION FOR SEQ ID NOi24: 

( i ) SEQUENCE CHARACIERISTICS: 
( A)lJENCTH:44ba$epai« 
( B )TYPE: nodetc acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOCYi linear 

( I I ) MOLECULE TYPE: DNA (pcobe) 
( X ! ) SEQUENCE DESCRIPnON: SEQ ID NO:24: 
TAATACGACT CACTATAGGG AGACATGAAT GACATTTACA 



( 2 ) INFORMAnON FOR SEQ ID NO:25: 

( 1 ) SEQUENCE CHARACTERISnCS: 
( A ) LENCrnt 44 base 
( B ) TYPE: nndeic acid 
( C ) STRANOEDNESS: ato^e 
( D)TOPOLOGlft linear 

( J I ) MOLECULE TYPE: DNA (pobe) 

( X I ) SEQUENCE DESCRintON: SEQ ED NO:25: 

CGOAATTAAC CCTCACTAAA GGACATGAAT GACATT 



( 2 ) INFORMAnON FOR SEQ ID NOi26: 



45 
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( i ) SEQUENCE CHARACTERISnCS: - • 
( A ) LENGTH: 12 base pairs 
( B )TYPE: Dodck acid 
" ~ ( C )STRANDEDNESS: single . *. " 
( D ) TOPOLOGY: linear ^ / 

( i i ) MOLECUIETVPE: DNA(pobc) . . 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



TTTATGGGGT OA V 



1 2 



( 2 ) INFORMATION FOR SEQ ID NOl27: 

( I )SEQUENCECHARACrERISTlCS: 
( A ) LENGTH: Ubascpaira 
• ( B ) TYPE: ii«cte!c add ^ ^ . 
. ' - C )Srni/^EbNESS:»ittgIc . ; " 

( D)TOPOL0CY: linear : 

( i i ) MOLECULE T5fPE:DNA(F0be) . 

( X i ) SEQUENCE DESCRtPllON: SEQ ID NO:27: 

TTGATTTATG CG 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO-^ 

( i ) SEQUENCE CKARACIBRISnCS: 
( A } LENGTH: U base pairs 
< B )TYPE:BtideIc»cid 
( C ) STRANDEDNESS: single 
( D )TOP0L0G1^ linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCWrnCW: SEQ n>N0:2S: 

AACCTATTTG ATT 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NO*^: 

( i ) SEQUENCE CHARACIERISnCS: 
< A )LENGrH: 12 base purs 
( D )TYP&nDclckacid 
( C ) SntANDEDNESS: smgle 
( D)TOFOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GGACCAAACC TA 



1 2 



( 2 ) INFORMAnON FOR SEQ n>KO-JQ: 

( I ) SEQUENCE CHARACIERISnCS: 
( A )LENGIH: 12 base pairs 
( B)TYFE:noddcactd 
( C ) STRANDEDNESS: smgie 
( D ) TOPOLOGY: Uncar 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUe<CE D£5CRirn(H«: SEQ ID NO-JO: 

AGGCTAGGAC OA 



1 2 



( 2 ) INFORMAnON FOR SBQ ID NO-Jl: 

( 1 ) SEOUENCZ CHARACTERISnCS: 
( A ) LENGTH: 13 base purs 
( B ) TYPE: nndek add 
( C ) SntANDEDNESS: single 
( D )TOPOLOGY^ Dnear 
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( 1 i ) MOLECULE TYPE: DNA(iprobe) 
( X i ) SEQUENCE DESCRimON: SEQ ID n6:31: 
GGTGTGTGTG TGC 



( 2 ) INFORM/OTON FOR SEQ ID N0-J2: 

' ' : ( 1 ) SEQUENCE CHARACTERISTICS: ' ' 
( A ) LENGnt U base pairs 
( B ) TYPE: Docleic add 
, ( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Uaear 

( i i ) MOLECULE TYPE: DXA(piobe) . 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:32: Y 

CGGTGTGTGT OTGC 



( 2 ) INFORMAnON FOR SEQ ID NO-33: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCnt 14 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i )MOUECULETYPE:DNA(iprobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NOa3: 

GGTGTGTGTG TGCT 



( 2 ) INFORMAnON FOR SEQ ID N0-J4: „ .1 ^ 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nodek ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNAOprobc) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

CTOGGTAOOA TO 



( 2 ) INFORMAnON FOR SEQ ID NO:35: 

( I ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 12 base pairs 
( B ) TYPE: nndeic acid 
( C ) STRANDEDNESS: single 
( D )T0P0LOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFTION: SEQ ID N0:35: 

TGCTOOGTAG GA 



( 2 ) INFORMAnON FOR SEQ ID NO:36: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENC7TH: 12 base pairs 
( B ) TYPE: DDdeic acid 
( C ) STRANDEDNESS: single 
< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( X 1 ) SEQUENCE DESCRIPTION: SEQ ED NO:36: 
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TGTGCTGGGT AG 



( 2 ) [NFORMATION FOR SEQ to NO:37: ' - 

(J ) SEQUENCE CHARACTERI^CS: ' ; . 

. '.( A )I_ENGTH: 12 base pairs' ■ , %. - . v 

( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single ~ 
. , „ . D ) TOPOLOGY: linear , , . ; ^ . . V . 

( I I ) MOLECUlJE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEO ro NO:37: 

GTTAGCAGCGGT ... ^j. 



( 2 ) INFORlilAnON FOR SEQ ID NO:38: ' " • ~ 

. . ■ ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 17 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )T0?OLOOY: linear - 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

GGGTTAGCAG CG . 12 



( 2 ) INFORMATION FOR SEQ ID NO:39: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pain 
( B ) TYPE: nucleic acid 
.:....:.,( C ) STRANDEDNESS: single 
• . D )TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO J9: 

AGCGGGGGAG G 11 



( 2 ) INFORMATION FOR SEQ ID N0:40: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: lObasc pain 
( B ) TYPE: nocielc acid 
( C ) STRANDEDNESS: sfagle 
( D )TOPOLOGY: Ifaiear 

( I i ) MOLECULE nrPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

AGCGGGGGAG 10 



( 2 ) INFORMAnON FOR SEQ ID N0:41: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pair* 
( B ) TVPE: nucleic acid 
( C ) STRANDEDNESS: shigle 
( D )T0P0LOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ DO N0:41: 

CGTTOCTTCC C . 11 



< 2 ) INFORMAnON FOR SEQ ID NO:42: 
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( I ) SEQUENCE CHARACIERISmCS: 

. , ( A )LENC?IH: 12 base pairs > 
^ B ) TYPE: nwlcic Bcid . • 

■ ' ' ( C ) STRANDEDNESS: single ' 

. ( D ) TOPOLOGY: linear 

■ / ;( i i) MOLECULE TYPE: DNA(piobc) _ 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:42: 

GGGTTTG(3TT ' GO V ' " ' :^ " 7 ' ^ 

( 2 ) INFORMAnONFORSEQroNO:43: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) lENCnt 12 base pain 

: . ( B )TYP!E: nucleic acid _ . 

: (C)STRAND^ ' . 

( D )TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA^Kobe) 

( X i ) SEQUENCE UESCRIFnON: SEQ CD NO:43: 

GATCTTTGGG GT 

( 2 ) INFORMAnON FOR SEQ ID NO:44: 

( i ) SEQUENCE OWRACIERISnCS; 

{ A ) LENGTH: 12 base pairs 
( B )TYPE: nndek »cH 
( C ) STRANDEDNESS: »!nglc 
( D JtOPOLOGK linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GGGTGATCTT TG 

( 2 ) INFORMAnON FOR SEQ ID NO:45: 

( i ) SEQUENCE CKARACTERISnCS: 
( A ) LEKGIHr 12 base pairs 
( B)TYPE:niiclelc»cid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY, linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:45: 

TGTGGGGGGT GA 

( 2 ) INFORMATION FOR SEQ n>KO:46: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12base pain 
( B )TlfPE:iiiideie«cid 
( C ) STRANDEDNESS: smgle 
( D) TOPOLOGY: Uoear 

( I I ) MOLECULE TYPE DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

TAAACTGTGG GG 

( 2 ) INFORMAHON FOR SEQ © N0:*7: 

( I ) SEQUENCE CHARACTERBTICS: 
( A ) LENGTH: U base pairs 
( B )TYPfcnndcie»cld 
( C ) STRANDEDNESS: single 
< D )T0P0L0CTt linear 
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, ( i i ) MOLECULE ryR&DNA (probe) - 

( X i ) SEQUENCE DESCRIFnONi SEQ ID NO:47: . ■ ' ; . j ^ ' • ; - - • . " ' ^ . . ' , * 

GCTACATAAA ■■CTO ■■ ' - ■ y: . . ■ 1 3 

( 2 ) INFORMAnON FOR SEQ ID NO:48: . . " 

• : ■( I ) SEQUENCE CHARACinilSTICS: v"^/ / ^'^ " " ^ ^ '^ " ' 'J ' 

( A ) l£NCrrH: 13 base pairs ■ 

( B )TirPE:in>deicBcId , . . 

( C ) STRANDEDNESS: single \ 
( D )TOPOLOGy:Uiiear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

(x 1 ) SEQUENCE DESCRimON: SEQ ID NO:48: ' ' / ' "v/, . : ' •': % ;.V > ^ . 

GAGGTAAGCT ACA . : _ / , „ .. .. M 

( 2 ) INFORMAnON FOR SEQ n) NO:49: 

( I ) SEQUENCE CHARACreaiSnCS: * • . . 

( A )LENGIH: 12ba»epali$ 
( B ) TYPE: UDcIclc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAOGAGGTAA GC 12 

( 2 ) INFORMAnONFORSEQlbNO-Jth - . . „. : . * . 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPB Dodeic odd 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( 1 I )MOl£CULETVrE: DNA (probe) 

( X i ) SEQUENCE DESCRIITION: SEQ ID NO:50: 

TGCTTTGAGG AO 12 



( 2 ) INFORMAnON FOR SEQ ID N0:S1: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B )TYFE:nod<icDcId 
( C ) SIKANDEDNESS: single 
( D )T0roLOGY:UDear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCBIPTTON: SEQ ID NO:51: 

AGTGTATTGC TTT 1 3 



( 2 ) INFORMAnONFOR SEQIDNO-^ 

( I ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 13 pairs 
( B )TYFE:midektdd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 



( X i ) SEQUENCE DESOOFnON: SEQ ID NO'J2: 



:attttcagt gta 

( 2 ) informanon for seq id no:53: 

( I )SEQIIENCE CHARACTERISTICS: 
~ , ," ( A ) LENGTH: 13 base pairs 

( B ) TYPE: nndeic acid 
( C ) STRANDEDNESS: single 
- \. . ( D ) TOPOLOGY: linear . . 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

TAAACATTTT CAG 



( 2 ) INFORMAnONF6RSEQ IDNO:54: " - - 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( ! I ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:54: 
AGCCCGTCTA AA 



( 2 ) INFORMATION FOR SEQ ID NO:55: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 12 base pairs . 
' ' ( B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPnON: SEQ ID NO:55: 



GAGCCCGTCT AA 



( 2 ) INFORMATION FOR SEQ ID NO:56: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: niictejc acid 
( C ) SriRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION; SEQ ID NO:56: 

TGATOTGAGC CC 



( 2 ) tNFORMAnON FOR SEQ ID NO:57: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X J ) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

GCGCTGATGT GA 



( 2 ) INFORMAnON FOR SEQ ID NO:58: 
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. ( i i ) MOLECULE TYPE: DXA(probc) 

( X i )SEQUENCE"DESCRIFnON:SEQIDNb:63: 
GGGTTGTATT GAA 



( 2 ) INFORMAnON FOR SEQ n> NO:64: 

( I ) SEQUENCE CHARACTEREnCS: ^ ^ 
■ . ■ ( A )lJ2NGrrH:10basepain 
( B ) TYPE: nndele add 
( C ) STRANDEDNESS: wag^c 
( d JtOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GCGGGGGTTG 10 



( 2 ) INFORMAnON FOR SEQ ID NO:6S: 

( I ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 10 base pairs 
< B )T^rPE;nacleic»cid 
( C ) STTRANDEDNESS: shj^e 
( D ) TOPOLOGY: Uuear 

( 1 I )MOLECUUETyPE: DNA(prob«) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:6S: 

ATOGOCGGGC 



( 2 ) INFORMAnON FOR SEQ ID NO:66: 

( I ) SEQUENCE CHARACTCRIsnCS: 
( A ) LENGTH: 11 base paks 
( B )TYP£:iiQclek»d(l 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

TAGGATGGGC G 11 



( 2 ) INFORMAnON FOR SEQ ID NO:67: 

< I ) SEQUENCE CHARACTERISTICS: 
( A) LENGTH: 12 base pain 
( B ) TyPE: nuclek •cid 
( C ) STRANDEDNESS: single 
( D )TOPOLOCY:lioew 

( ! I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

TGGGTAGOAT GG 



( 2 ) INFORMAnON FOR SEQ ID N0:ti8: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pahs 
( B ) TYPE: Dodek acid 
( C ) STRANDEDNESS: sfaigte 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (pobe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
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GTGCTGGGTA GO 



1 2 



( 2 ) INFORM/mON FOR SEQ ID NO:69: .. . : - ./ 

( I ) SEQUENCE CHARACTERIfnCS: " ' \ . 
... ' , ( A ) LENGTH: 12 base pairs V ; / 

( B ) TYPE: nocldc add 
( C ) SniANDEDNESS: $fai£|e 
- ^ , , i^} TOPOLOGy: linear . / - , 

(II ) MOLECULE TYPE: DNA (probe) ' - . 

' ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

TGTGTGTGCT GG 



-J- * - 



1 2 



( 2 ) INFORMATION FOR SEQ ID NOiTth • 

(I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nvdciti acid 
< C ) STRANDEDNESS: single 
( D )T0P0LOGY:Unear 

( I 1 )MOLECULE'nrrE:DNA (probe) 

, ( X I ) SEQUENCE DESCRIPTTON: SEQ © NO:70: 

GCOGTOTGTG TG 



1 2 



( 2 ) INFORMATION FOR SEQ ID NO:71: 

( I ) SEQUENCE CHARACTERISnCS: 
< A ) lENGIH: I2base pairs 
( B )TYP£: nucleic acid 
-( C )STRANDEDNESS: single - 
( D )T0POLOOY: linear . 

( i 1 ) MOLECULE TYPE: DNA(ixobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID KO:71: 

TAGCAGCGGT GT 



1 2 



( 2 ) INFORMATION FOR SEQ ID NO:72: 

( I ) SEQUENCE CHARACIERISnCS: 

< A ) LENGTH: 12 base pairs 

< B) TYPE: Boclelc acid 

( C ) STRANDEDNESS: single 
( D)TOPOLOGY:UoeaT 

( I i )MOLECULETYPE:DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:7Z- 

TGGGGTTAGC AG 



1 2 



( 2 ) INFORMAnON FOR SEQ CD NO:73: 

( I ) SEQUENCE CHARACIERISTTCS: 
( A ) LENCriK: 12 base pairs 
( B ) TYPE: hdcIcIc acid 
< C ) STRANDEDNESS: sbgle 
( D )T0P0LOG^ linear 

( 1 I ) MOLECULE TYPE: DNA(ivobe) 

( X 1 ) SEQUENCE DESCRIFnON: SEQ ID NO:73: 

CGTATGGGGT TA 



1 2 



( 2 ) [NFORMAnON FOR SEQ ID NO:74: 
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( I ) SEQUENCE CHARACTERISTICS: . , ' 
- ( A ) LENGTH: 12 base pairs - , 

( B ) TYPE: nndcic acid .• " ■ 

. ( C ) STRANDEDNESS: single ./ . 

- < D )TOPOLOGY: linear . ^ 

. ( i i )M01£CULETYPE: DNA(probe) , . . r . . / 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:74: ^ 
GTTCOGGOTA TG. . ■ ; -v,:;' ^ ^ , . 12 

( 2 ) INFORMAnON FOR SEQ ID NO:75: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 

_ ., ( B ) TYPE: nndeic acid ....l . . - 

. , C)SrRANDEDNESS: single \ ' ' ; . " ' ' ' 

( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON; SEQ ED NO:75: . 

GCTGGTGTTA GG 12 

( 2 ) [NFORMAnON FOR SEQ ID NO:76: 

(I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH; 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)TOP0LOCY: linear 

( i i )MOLECULE TYPE: DNA (probe) . 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:76: . . 

GGTTAGGCTGOT 12 

( 2 ) INFORMAnON FOR SEQ ID NO:77: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGHTH: 13 base pairs 
( B ) TYPE: noclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 )MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

AAATCTCGTT AGO 13 



( 2 ) INFORMAnON FOR SEQ ID NO:78: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFHON: SEQ ID NO:78: 

AAATTTGAAA TCT 



( 2 ) INFORMAnON FOR SEQ ID NO:79: 

( I ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE: DQcIeicacid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lincsr 
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• ( i i ) MOLECULE TYPE: DNA (probe) • . . 
( X i ) SEQUENCE DESCRDTIpN: SEQ ID NO:79: 
A AG AT AAA AT TTG . :7 "v. 



1 3 



( 2 ) INFORMATION FOR SEQ ID NO:80: . . . ; ' 

^' - - ( I ) SEQUENCE CHARACiERISTICS: '^!^;:^^V 

( A ) LENGTH: 12 base pairs . 
/( B ) TYPE: DDcleIc acid :: : : 

' ( C ) SniANDEDNESS: sin^e 
( D ) TOPOLOGY: linear 

. ( i i ) MOLECULE TYPE- DNA (probe) 

' . ( x ,i ) SEQUENCE DESCRIPTIpN: SEQ ID NOiisO: 

GCCAAAAAGA TA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID N0:8l: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B )TYPE:nacIeIc acid 
( C ) STRANDEDNESS: single 
< D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:81: 

CCCCAAAAAG A 



1 1 



. ( 2 ) INFORMAnON FOR SEQ ID NO:82: ' 

( i ) SEQUENCE CHARACTER[SnCS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

CATACCGCCA A 



1 1 



( 2 ) INFORMAnON FOR SEQ ID NO:83: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
< B )TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X J ) SEQUENCE DESCRIPTION: SEQ ID KO:83: 

AAAAGTGCAT ACC 



1-3 



( 2 ) INFORMAnON FOR SEQ CD KO:84: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: ncdeie acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID KO-^: 
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TGTTAAAAGT GCA 



1 3 



( 2 ) INFORMAnON FOR SEQ ID MOtSS: - " ■ . . . 

( i ) SEQUENCE CHARACIERKnCS: . 
; , ; .( A ) LENCmt U base pairs 

' ( B )TYPE:n»dek:acid : 
( C ) STRAKDEKfESS: sfagle 
. - ; : ( D )TOP01pGV: linear; > v.,: V., .: 

( i i ) MOLECULE TYFai»iA(F0t)c) 

( X I ) SEQUENCE DESCBffTO)N: SEQ ID NO:85: 

GGGTGACTGT TAA 



1 3 



( 2 ) INFORMAnON FOR SEQ ID K0t«6j 

( i ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTIH U1»se pairs 
( B )TYP£aKkk>cid 
( C ) mANDEDNESS: shgle 
( D ) TOPOLOGY Imcar 



( I 1 ) MOLECULE TYPE: I»U (probe) 

( X I ) SEQUENCE DESaomON: SEQ IDNO:86: 



GGGGGTGACT GT 



1 2 



( 2 ) INFORMATION FOR SEQ ID KOfc87: 

( J ) SEQUENCE CHARACIERBTICS: 
( A ) UEKGIH: U W pairs 
( B )1TPE:BKldc«dd 
( C ) CTRANDEDNESS: single : ^ 
( D )T0P0L0G1&tiBear . 

( 1 I ) MOLECULE TYm 0>fA<irobc) 

( X I ) SEQUENCE DESOOFnON: SEQ ID NO:87: 

AGTTGGGGOG T 



1 1 



( 2 ) INFORMATION FOR SEQ ID KO:88: 

( 1 ) SEQUENCE CHABACIERISrnCS: 
( A ) LENGnt Objsc pairs 
( B ) TYPE: vadde Kid 
( C ) STRANDEDNESS: single 
( D )TOPOL0G1CUBear 



( I I > MOLECULE TYPE DHA (probe) 
( X i ) SEQUENCE DESatmiaN: SEQ ID NO:88: 
TGTGTTAGTT GGG 



1 3 



( 2 ) INFORMATION FOR SEQ ID WOt89: 

( 1 ) SEQUENCE CHARACIERISTICS: 
< A ) UEKGIft Ubasepaiis 
( B ) TYPE: Biddc add 
( C ) STRANDEDNESS: single 
( D )TOFOL0G1&liDear 

( I I ) MOLECULE TYFE:I»lA(Fbbe) 

( X I ) SEQUENCE DESdOniDN: SEQ ID NO:89: 

AAAATAATGT GTT 



1 3 



( 2 ) INFORMAHON FOR SEQ ID VOSO: 
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( I ) SEQUENCE CHARACreRrsnCS: 

( A ) LEKCIH: 12 base pairs . 
. ( B ) TYPE: noclclc acid : 
; ■ : ( C )STRANDED^fESS: single/ 
( D )TOPOLOGY: linear 

. - r.( i i )MOLECULE TYPE: DNA (probe) = \; . 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:90: 

A G G G G A A A A T A A * ■ ' - -H " — ? ' T : 



1 2 



( 2 ) INFORMATION FOR SEQ ID N0:9l: 

( i ) SEQUENCE CHARACTERISrnCS: 
( A ) LENGTH: 12 base pairs 

^ . . ( B ) TYPE: nucleic ecJd , . . 

■ " " " ' " : ■ ' ( C ) STOANDEDNESS: single 

( D ) TOPOLOGY: linear 



( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRHTION: SEQ ED N0:?1: 
GGAGGGGAAA AT 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:92: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: Qocleic add 
( C ) STRANDEDNESS: single 
( D )T0P0LOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

GGAAATTTTT TG 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:93: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
< B ) TYPE' nncleic acid 
( C ) CTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPnON: SEQ ID NO:93: 

GGTGGAAATT TT 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:94: 

. ( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nodeic acid 
( C ) SIRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

GGTTTCGTGG A 



1 1 



( 2 ) INFORMAnON FOR SEQ ID N0:9S: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B )TYP£:nndeicadd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID Np:95: 



GAGGGOGGGT T 



1 1 



( 2 ) INFORMAnON FOR SEQ ED NO:96: 

■ ■ A ( 1 ) SEQUENCE CHARACTERISTICS: " 
( A ) LENGTH: 10 base pairs 
. ( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sin^c 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRinioN: SEQ b NO:96: 

GCGGGOGAGG 



1 0 



( 2 ) INFORMAnON FOR SEQ ID NO:97: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:97: 

CAGAAGCGGG G 



1 1 



( 2 ) INFORMAnON FOR SEQ ID NO:98: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO;98: 

GTAGGCCAGA AG 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:99: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) SIKANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TVPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

GTGCTGTAGO CC 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:l00e 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID KO:100: 



73 



5,837,832 



-continued 



74 



TGTTTAAGTG CTG 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NO:101: 

( i ) SEQUENCE CHARACIBRISnCS: 
• ( A ) LEKGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
. ( ^ ) TOPOLOGY: linear ... . , . . 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPnON: SEQ ID N0:i01: 

TGTGTTTAAG TGC 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NOIOT: " ' f : 

( i ) SEQUENCE CHARACTERrsnCS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( K i ) SEQUENCE DESCRIPTION: SEQ ID NO:102: 

GCAGAGATGT GTT 



1 3 



( 2 ) INFORMATION FOR SEQ ID NO:103: 

< I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paJra 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear — 

( ! ! ) MOLECULE TYPE: DNA (probe) 

( )c i ) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

TTTGGCAGAG AT 



1 2 



( 2 ) INFORMATION FOR SEQ ID NO:l(M: 

( I ) SEQUENCE CHARACTCRISnCS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TVTE: DNA (probe) 

( jc i ) SEQUENCE DESCRIPTION: SEQ ID N0:1M: 

GGGGTTTGGC A 



1 1 



( 2 ) INFORMATION FOR SEQ ID NO:105: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID Na.IOS: 

TCTTTTTGGO OT 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:L06: 



, 5,837,832 • 

-continued ' ^ - 



. ■ ( I )SEQUENCE CHARACTERISTICS: • 

■ ' ■ ' ( A ) LENOnt 12 ba« pairs ■ . , 

- ( B )'mE:nodeicBcid ■ ... . \ : ■ ^ V ' 

; ( C ) STRANDEDNESS: single . . , , ,/ 

" ( b ) TOPOLOGY linear ] : 

( i i )M01JECULETYP£DNA(piobc) ■ ' . ■' ' ^' . ■ - ^- - *' 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:106: — 

.TTT'gTTTTTO'- GG ^r:v^>'r uvr >^ '^-^ ^ .- .: ' . 12 

( 2 ) INFORMAnON FOR SEQ ID NOilOT: 

( i ) SEQUENCE CHARACTERISTICS: ^ 
( A ) LENCrlH: 12 base pairs 
( B ) TYPE: nodeic acid 

"■ ■ . ( cJsTRANDEDNESS: single ':: : " ' - ; - -v ' ^V- -:- ^r. , - - -. ■ - 

( D )TOPOLOGV. linear 



( i i )MOLECUlETyPE:DNA (probe) 
( X i ) SEQUENCE DESCRimON: SEQ ID NO:107: 
GGGTTCTTTG TT "12 

( 2 ) INFORMAnON FOR SEQ ID NaiOS: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE:nndeicBcid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGXrmcaT 

( i i )MOLECULETyPE:DNA (probe) 

(x i ) SEQUENCE DESCRHTION: SEQ n> NOrlOS: 

GTGTTAOGGT TCT 13 

( 2 ) INFORMAnON FOR SEQ ID NOilOS: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nudcic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOG% linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFTiON: SEQ ID N0:1(»: 

TTTAGTAAGT ATGT 14 

( 2 ) INFORMAnON FOR SEQ ID NOOia 

( i ) SEQUENCE CHARACTERISrnCS: 
( A ) LENGTH: U base pairs 
( B )TVPE:niideic8ctd 
( C ) STRANDEIWESS: single 
( D ) TOPOLOGY: linear 

( I ! ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQliENCE DESCRIPTIQN: SEQ ID NO:110: 

AACACACTTT AGT 13 

( 2 ) INFORMAnON FOR SEQ ID NOdll: 

( i ) SEQUENCE CHARACIERIsnCS: 
( A ) LENGTH: 14 base paiis 
( D ) TYPE: nodek add 
( C ) STRANDEDNESS: single 
( D )T0POL0G1Ciiacar 
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( i I ) MOLECULE TlfPE: DNA(ptobe) . 
( X i ) SEQUENCE XfESCRrrnON: ^ ID Naill: 
AATTAATTAA CACA ' : / 



( 2 ) INFORMATION FOR SEO ID NOrUi 

■ . ( i ■) SEQUENCE CHARACTERISTICS: - -■ - y^f^--:^--- 

( A ) LENGTH: 13 basepain . = ' - 

^ '■■-■( B)TYPE:wideJcacid " ; ' ''\ , 

■ (cisTRANDEDNESSishigle ...... : 

(D)TOFOLOGY:lmeai 

( I i ) MOLECULE TYF&DNA (probe) 

' ~ ( Vi )SEQUENCEnESCRiiaTaNiSEQ ibN0:ii2:^'." ~>T:;~:~ '77 ;:^: ' ; - . 

AAGCATTAAT TAA . : ^ 



( 2 ) INFORMATION FOR SEQ ID N0:113: 

( 1 ) SEQUENCE CHARACmmiCS: 
( A)LENGlH:13bas«FaIn 
( B)TYFE:nocIeteacUl 
< C ) STRANDEDNESS: sioffe 
( D)TOroLOCY:Uiicar 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:lt3: 

GTCCTACAAG CAT 



( 2 ) INFORMAnON FOR SEQ ID N0:U4: . • 

( I ) SEQUENCE CHARACTERISTICS: 
( A)LENCrni: 13 base pain 
( B)TYFE:BQcteIcBdd 
( C ) STRANDEDNESS: ala^le 
( D)T0POLOGY:Uiicij 

( i i ) MOLECULE TYPE: DXA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:1 14: 

TCTCCTACAA OCA 



( 2 ) INFORMAnON FOR SEQ ID N0:115: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 13 base pain 
( B)TYF£:nBc)ekKid 
( C ) SntANDEDNESS: th^c 
( D)TOFOLOGY: linear 

( I I ) MOLECULE TYPE* DNA (probe) 

( X 1 ) SEQUENCE DESCRDTION: SEQ ID NOillS: 

ATTATTATOT COT 



( 2 )INFORMAnONPOR^n>NO:U6: 

( 1 ) SEQUENCE CHARACIERISnC& 
( A ) LENGTH: 14 base pain 
( B) TYPE: Bodete add 
( C ) SrntANDEI»«ESS: utgte 
( D)TOPOLOGY:tiBeir 

( i I ) MOLECULE TYPE: DXA (probe) 

( X I ) SEQUENCE DESCRIFnON: SEQ XD N0:116: 
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TTGTTATTAT TATG , ■• . . 14 



( 2 ) INPORMATION FOR SEQ ID N0:H7; 

( i ) SEQUENCE OIARACTERISTICS: : 
i 1 ( A ) lENCrnt U base pairs 

( B )TYPE:nndeicadd 
( C ) SrRA^a>EDKESS: single 
- • ; ( D ) TpPOLOGY: linear . \ 



( i i ) MOLECULE TYPE: DNA(F0be) 
( X i ) SEQUENCE BESCRimOri: SEQ ID N0:U7: 
ATTCAAATTG TTA 13 

( 2 )mFORM«ioNFORSEQroNau8: " ■ J : - : ; ' . 

( I ) SEQUENCE CHARACTERISTICS: , . 

< A)LEN<7IH:13basepaira i 
( B )TYFE: nodcic acid 

{ C ) SntANDEDNESS: single 
( D)TOFOLOC^:tineai 

( I 1 ) MOLECULE TYFE:DXA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID N0:118: 

GCAGACATTC AAA 13 

( 2 ) INFORMATION FOR SEQ ID NO:tt9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LEKOTH: 12 base pain 
( B )TYF£: nucleic «cu) 
( C )STOANDEDNESS: single 

< D )T0FOLOGY:tiaear 

( 1 i ) MOLECULE TYPE: DXA (probe) 
( X ! ) SEQUENCE DESCRIFnON: SEQ ID N0:ll9: 
CCTGTGCAGA CA 12 

( 2 ) INFORMATION FOR SEQ ID N0:12Q: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTIH: 12 base pairs 

< B )TrFE: nucleic acid 

< C ) ^niAKDEDNESS: single 
( D )TOFOLOCY:Unear 

( I I ) MOLECULE TYFErDNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:120: 

AAAGTGGCTG TG 12 

( 2 ) INFORMAnON FOR SEQ ID NO:121: 

< i ) SEQUENCE CHARACIHRtSnCS: 
( A ) lENGIH: U base pain 
( B )TYF£: sDddc acid 
( C ) STRANDEDNESS: sta^e 
( D )T0FOLOCYi linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFTtON: SEQ ID N0:12t: 

TGTOTGGAAA CTC 13 



( 2 )INFORMAnONFORSEQIDNai22: 
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( I ) SEQUENCE CHARACTERISTICS: 
. ( A ) LENGTH: 13 bwc pairs 
. ( B )TYPE: nodclc Bcid 
J ' ( C ) STRANDEDNESS: »higlc 
. ^ ( D )TOPOLOGY: linear ■ 

( J 1 )MOLECUl£ TYPE: DNA (probe) . 

{ X i ) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

-GAT ore TO TO TCQ ^l^-^-y •y.'y^-y- 'i^^.,- -rnr V---^ C:*^;-; \ ^ 1 3;-^; 

( 2 ) INFORMAnON FOR SEQ [D NO:123: . - 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
. ' ( B ) TYPE: noclcic acid ~ ' . 

c )CTRANDEDNESS: single ^ ■ . 

, ^ ■ ; ( D ) TOP0LOGY: linear ■ - . . V ; ; ^ . , 

^ ( 1 i )MOLECULE TYPE: DNA (probe) . " • - ' . 

( X i ) SEQUENCE DESCRIPTION: SEQ n> NO:123: 

ATGATGTCTG TGT 13 



( 2 ) tNFORMAnON FOR SEQ ID NO:124: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nudeic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

(II ) MOLECULE TYPE: DNA (probe) 

. ( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

TTTTGTTATG ATG . . . 13 



( 2 ) INFORMAnON FOR SEQ ID N0:125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

TTTTTTCTTA TGA 13 



( 2 ) [NFORMAnON FOR SEQ ID NO:126: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: noctelc ftcid 
( C ) mAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID Nai26: 

ATAGOGTGCT CC 12 



( 2 ) INFORMAnON FOR SEQ ID NO:127: 

( i ) SEQUENCE CHARACIHRlSnCS: 
( A ) LENCTTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: stogie 
( D ) TOPOLOGY: linear 
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■ ( i i ) MOLECULE TYPE: UNA (iptoic) 

( X i ) SEQUENCE DESOOrilON: SEQ ID NO:127: 
GCGACATAGG GT - ~' 



1 2 



( 2 ) INFORMAnON FOR SEQ ID N0:12& . 

^'^^^ ■ i ) SEQUENCE CHARACIERtSTICSt ' ^ 

, ( A)LENCntl5bascpaira 
>/( B ) TYPE: noefcSc add ^ 
( C ) STRANDEI»iESS: sfe^c 
( D )TOPOLOG% linear 

( i i ) MOLECULE TYPE- UNA Gk*c) 

; ( X i ) SEQUENCE DESaOFnON: SEQ ID NO:128: 

TACTGCGACA TAG 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NOtaaSr 

( I ) SEQUENCE CHARACIERIsnCS: 
( A )L£KGntUt>3sepatn 
( B )TYFE:ii«dcie*c{a 
( C ) STRANTODNESS; sm^e 
( D ) TOPOLOGY Uoear 

( i ) ) MOLECULE TYPE: KXA (probe) 

( X i ) SEQUENCE DESCRimVI: SSQ ID NO:129: 

GACAGATACT GCG 



1 3 



< 2 ) INFORMAnON FOR SEQ ID NOtUOb 

( i ) SEQUENCE CHARACIERISTICS: 
( A )LEKani:U base pairs 
( B )TYFE:iiDcidcadd 
( C ) STRAKraCNESS: sbgte 
( D )TOPOLOa£lineu 



( 1 i ) MOLECULE TYPE DX\(iprob«) 
( X I ) SEQUENCE DESCRHHON: SEQ ID NO:130: 
AATCAAAGAC AGA 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NOtUk 

( I ) SEQUENCE CHARACIERKnCS: 
( A ) lEKOTH: U base pabs 
< B )TYPE:nKldcK{d 
( C ) STRASDEBKESSi shfte 
( D )TOPOLOG%&Bcsr 



( 1 1 ) MOLECULE TYPE DMA ((robe) 
( X i ) SEQUENCE DESOOmO^ SEQ ID N0:131: 
AGOAATCAAA OAC 



1 3 



( 2 ) INFORMAnON FOR SEQ ID N0eU2 

( 1 ) SEQUENCE CHARACIERISnCS: 
( A )L£NGTU:12bisepQlis 
( B ) TYPB BKSoe »c{d 
< C ) STRANDbUNi^SS: t&^c 
( D ) TOPOLOGY Boear 

( i i > MOLECULE TYPE: UNA (pobc) 



( X i ) SEQUENCE DESCRirnON: SEQ ID NO:132: 
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TGAGGCAGGA AT - 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NOJ33: 

. (i) SEQUENCE CHARACTCRISTTCS: 
. . - ( A ) LENffnt 12 base pairs 

( B )TYPE: undeic add 

- - ■ ; ( C )srreANDEDN^ 

- . . . . V ( D )T0IO1.00y: liaw . . • 
(II ) MOLECULE TWErDXA (probe) 
( X I ) SEQUENCE DESaUFTION: SEQ ID NO:l33: 

AGGATGAGGC AG 



1 2 



( 2 ) INFORMAnON FOR SEQ ID Kai34: ' \ . ^ 

( i ) jSEQUENCE CHARACIBRISnCS: 

( A ) LEKCTni: 13 base pairs ~ 
< B ) TYPE: aodeic add 
( C ) STRANI^NESS: sh^Ie 
( D) TOPOLOGY: liaeai 

( ! I ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPnON: SEQ IDNO:134: 

AAATAATAGG ATO 



1 3 



( 2 ) INFORMATION FOR SEQ ID NO:U5: 

( I ) SEQUENCE CHARACTCRISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE- BDddc add 
, ( C ) STRANDEDNESS: slofle . 
. ( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPHON: SEQ ID N0:13S: 

GCGATAAATA AT 



1 2 



( 2 ) INFORMATION FOR SEQ ID NOrUfi: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pain 
( B )TYP&i»de!cadd 
( C ) SrntANDEDNESS: sfngle 
( D ) TOPOLOGY: Unear 

( I I ) MOLECULE TYTE: DNA (probe) 

( X 1 ) SEQUENCE DHSOOFnC^i: SEQ ID NO:L36: 

TAGGATGCGA TA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID KO:137: 

< I ) SEQUENCE CHARACiHUSnCS: 
( A ) LENGTH: 12 base pairs 
( B )TirP£Bacleicadd 
( C ) STRANDEDNESS: sh^e 
( D ) TOPOLOGY, linear 

( 1 1 ) MOLECULE TirPe:DXA([*<are) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID Nai37: 

GTAGGATGCG AT 



1 2 



( 2 ) ENFORMAnON FOR SEQ ID NO:138: 
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( I ) SEQUENCE CHARACIERISrnCS: ' * 

( A ) LENGTR- 12 base pairs 

( B ), TYPE: nucleic ecid , , - . • - 

( C )STRANDEDNESS: single . " V ■ > ^ 

. . ( D) TOPOLOGY: linear .. - ' . 

( I i ) MOLECULE TYPE: DNA (probe) . . ■ 

( X I ) SEQUENCE DESCRIPTION: SEQ ro N0:138: / 
TTG'AACGTAG GA ■ ■ ' ' 'y--'^- ' ' " Z':. -V j j 

( 2 )INFORMAnONFORSEOIDNai3^ - 

( J ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 13 base pairs . 

. ( B )T5fPE: nucleic acid -! _ . . , _ . _ __ 

( C ) CTRANDEDNESS: single ' " \ . : : i'' . .r / ; ■ \ - . 

( D ) TOPOLOGYi linear . ' 

(I I ) MOLECULE TYPE: DXA(piobe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ED NO:139: . ^ 

AATATTGAAC GTA ' 13 



( 2 ) INFORMAnON FOR SEQ D> Nai4a 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: Ubasepairs 
( B }TYPE:niiclcIcacid 
( C } STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( i i )MOLECULETYPE:DNA (probe) . . 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:140: . . .. .. . 

GCCTGTAATA TTG 13 



( 2 ) INFORMAnON FOR SEQ ID N0:141: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( D ) TYPE: nadcic actd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 > MOLECULE TYPE: DNA^vobe) 

( X 1 ) SEQUENCE DESCRDTION: SEQ ID N0:141: 

TGTTCGCCTG TA 12 



( 2 ) INFORMAnON FOR SEQ ID NO:142: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nndetc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA(iir6be) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:142: 



( 2 ) INFORMAnON FOR SEQ ID NO:143: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nodeic acid 
( C ) HANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYRE: DNA (probe) ^ . 
( X i ) SEQUEKCE iJESCRimON: SEQ ID NO:143: 

CTCCCGTOAG TO ? : 



1 2 



( 2 ) INFORMATION FOR SEQ ID NO:144: — . 

: " - ^ I ,j sEQUENCE^ciWRAa^ 

( A ) UENC7IH: 12 base pairs 
';( B ) TYPE: nndelc acid 

( C ) SnCANDEDNESS: stngte 
. ( D) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE OESCROTION: SEQ ID NO:144: 

GAGAOCTCCC OT 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:145: . 

(I ) SEQUENCE CHARACIERISnCS: 
( A ) lENGIH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) SntANDEDNESS: sm^le 
( D )T0FOLOG^ Umax 

( 1 i ) MOLECULE TYPE: DNA (pcobc) 

( X I ) SEQUENCE DESCRimON: SEQ ID NO:l45: 

ATOGAOAOCT CC 



1 2 



2 ) INFORMAnON FOB SEQ ID N0:14«: - - - 

( I ) SEQUENCE CHARACTERISrnCS: 
( A )LENGTH: 12 base pairs 
( B ) TYPE: DDcIelc acid 
( C ) STHANDEDNESS: single 
( D )TOF0LOGY: linear 

( i 1 )M0LECU1£ TYPE: DNA (probe) 

( K I ) SEQUENCE DESCRIFTION: SEQ ID N0:146: 

AATGCATOGA GA 



1 2 



( 2 ) INFORMAnON FOR SCQ ID N0:147: 

( 1 ) SEQUENS CKARACTERISnCS: 
( A)l£NGTH: 12 base pairs 
( B )TYPE:inKle{ciicId 
( C ) SntANDEDNESS: single 
( D)TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (probe) 

( X I •) SEQUENCE DESCRIPTION: SEQ ID NO:147: 

ATACCAAATG CA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:148: 

( I ) SEQUENCE CUARACTERISmCS: 
( A ) LENGTH: U base pairs 
( B )TYPE: nactdc 
( C ) STRANDEDNESS: single 
( D )T0POLOGY: linear 

( i I ) MOLECULE TY^ DNA Oaobe) 

( X 1 ) SEQUENCE DESCRIPTTON: SEQ ID NO:I48: 
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GACGAAAATA CCA; * . ; /; ' ' * 

( 2 )INFORMAnONFORSEQn)NO:149: *-:.:V - ' :^ • 

( i ) SEQUENCE CHARACIERISnCS: ; - . ' 

-; ■':■/;;■■'.( A )iEKcaH: 11 bascpaus /V^^'^- ■■-■'V. ■ ' - 

( B )TVI£nvcI«cadd 

- . V ( C ) STOANDEDNESS: sfaglc ' • V . ' 

( D )TOil«LOGnft linear ' - - >-... 'i: ^ ^j' v; 

( 1 i)MOLECmJETlfPE:DNA (probe) 

( X i )SEQUENCEDESCRIFnON:SEQIDNO:149: . - - . 

CCCAGACGAA A 



( 2 ) INFORMAnON FOR SEQ ID NoilSa - f 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 11 base pairs - 
( B )Tyi& nucleic add 
( C ) SntANDEDNESS: single 
(D)TOIOIXX^ linear 

( i I ) MOLECULE TYTEiDNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:1SO: 

TACCCCCCAG A 



( 2 ) INFORMATION FOR SEQ ID NO:lSl: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LEKOIH: U base pairs 
( B )TirFE:nadeicadd 
( C )SntANDEDNESS: single 
( D )TOroLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA(pobe) 

( X i ) SEQUENCE DESOUPTION: SEQ ID NO:151: 

TGCATACCCCC 



( 2 ) INFORMATION FOR SEQ D>KO:I52: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LEKGIH: 12 base pairs 
( B )Tlfre:ftodeicacid 
( C ) SIRANDEDNESS: single 
( D)1OK)L0G\i linear 

( I 1 ) MOLECULE lYTE-DNA (probe) 

( X I ) SEQUENCE DESCRIPIION: SEQ ID NO:152: 

TCCCGTGCAT AC 



( 2 ) INFORMAnON FOR SEQ ID N0:151: 

( 1 ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B )Tnd»cIeIc Bcid 
( C ) SIRANDEDNESS: abiele 
( D ) TOPOLOGY: Ifaear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE OESaumON: SEQ fD KO:153: 

GACTATCOCG TG 



( 2 ) INFORMAnON FOR SEQ ID N0:1S4: 
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( i ) SEQUENCE CHARACIERISnCS: ' . ' 
.( A ) LENGTH: 12 base pairs 
. ; ( B )TYP£: nodeicacid . . 

V . ^ C ) STRANDEDNESS: single j ' . ■ - . 

... ( D)TOPOLOGy:Uncar . , _ - • 

. ( i i ) Mpl£CULETYP£ DNA (probe) - . • ^ • . 

( X I ) SEQUENCE DESCRIFnON: SEQ ID N0:154: - . ' 



( 2 ) INFORMAnON FOR SEQ ro NO:155: * 

(I ) SEQUENCE CHARACIERISnCS: 

( A ) LENGTH: 12 base pairs . 

. ( B )TyPE: nucleic acid , _ 

' • ' i C)SrrRANDEDNESS:»miIe - 

( D)T0POLOGY linear 

( ! i )MOLECULETYPEiDNA(pfobe) 

( X i ) SEQUENCE DESCRTPnON: SEQ ID N0:15S: 

CTCGCAATGA CT 



( 2 ) INFORMAnON FOR SEQ ID N0:156: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 baw pairs 
( B )lYP£:nucIcic8cid 
( C ) STRANDEDNESS: single 
( D)T0P0L0GY: linear 

(II ) MOLECULE TYPE: DNA (probe) . 

. ( X I ) SEQUENCE DESGRTFnON: SEQ ID NO:156: 

CGTCTCGCAA TO 



( 2 ) INFORMAnON FOR SEQ D> N0:U7: 

( 1 ) SEQUENCE CHARACTCRISnCS: 
( A ) LENGIK: U base pairs 
( B )TYP£:nQdeicac1d 
( C ) STRANDEDNCSS: s'mglc 
( D)T0POLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRffTTON: SEQ ID N0:157: 

CTCCAGCGTC TC 



( 2 ) INFORMAnON FOR SEQ ID KOilSSi 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: noddc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I ! ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPnON: SEQ ID N0:158: 

TCCGGCTCCA C 



( 2 ) INFORMAnON FOR SEQ ID NO:L59: 

( I ) SEQUENCE CHARACIERISnCS: 
* ( A ) LENGTH: tl base pairs 
( B )'nrPE nDctetcac!d 
( C ) SmANDEDNESS: single 
(D)T0FOLOGY:liQeai 



12 
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( i i ) MOLECULE mE; DNA (iptobe) 
( X . i ) SEQUENCE DESCRIFnON: SEQ ID NO:159: 
GTOCTC COGC ; T , " T 11 



( 2 ) INFORM/mON FOR SEQ ID NO:160: - 

: ( ! ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
r - - . - ( B ) TYPE: nucleic acid 

- ( C ) STRANDEDNESS: single 
( D )TOPOLOGy: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

: : ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:160: 

GACCCTGAAG TAG„ : ; . 13 



( 2 ) INFORMAnON FOR SEQ ID N0:16l: 

( I ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 13 base pairs 
( B ) TYPE: onctelc actd 
( C ) mAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:16l: 

TTTATGACCC TGA 



( 2 ) INFORMATION FOR SEQ ID NO:162: _. _ . 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

TTTAGOCTTT ATG 



( 2 ) INFORMAnON FOR SEQ ID NO:163: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic ocid 
< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TVPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

GCTATTTAGO CT 12 



( 2 ) INFORMAnON FOR SEQ ID NO:164: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH; 12 base pain 
( B ) TYPE: nucleic acid 
< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 




13 
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( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:164: 
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TGGGCTATTT AG 



1 2 



( 2 ) INFOEIMATION FOR SEQ ID NO:165: ' 

(I ) SEQUENCE CHARACTERISTICS: 
. :'r ( A ) LENGTH: 12 base pairs . : ; 

( B ) TYPE: DDcIcic add 
( C ) STRANDEDNESS: single 
. . ( D ) TOPOLOGY: linear ^ ^ . . . . , 

( i i )MOLECUUE TYPE: DNA (probe) , 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:165: 

ACGTCTGGGC TA 



I 2 



( 2 ) INFORMAnON FOR SEQ ID n6:166: ' „ V . . 

. . , . ( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 12 base pairs 
( B ) TYPE: Eodeic acid 
( C ) STRANDEDNESS: single 
( D)TOPOLOGy; linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

AGGGGAACGT GT 

( 2 ) INFORMATION FOR SEQ ID NO:l67: 

( I ) SEQUENCE CHARACTCRISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: oodeic add 
. „ - ( C )STRANDEDNESS:«ingle 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:167: 

TTTAACGGGA AC 

( 2 ) INFORMAnON FOR SEQ ID Nai68: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: Ubase pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D } TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:168: 

ATGTCTTATT TAAG 

( 2 ) INFORMAnON FOR SEQ ID N0:169: 

( I ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nndeic acid 
( C ) STRANDEDNESS: shgle 
( D)T0P0LOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID Nai69: 

CATCGTGATO TCT 

( 2 ) INFORMAnON FOR SEQ ID N0:17a 



1 2 



1 2 



1 4 



1 3 
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( i ) SEQUENCE CHARACIERISrnCS: 
( A ) LENGTH: 12 base pairs 
V ( B ) TYEE: nocleic acid 
' ( C ) STRANDEDNESS: single . 
( D )TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:170: 

TCCATCGTGA TG - 



( 2 ) INFORMAnON FOR SEQ TD N0:171: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCrlH: 12 base pairs 

( B ) TYPE: noclcic acid 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) . 

( X I ) SEQUENCE DESCRIFHON: SEQ ID N0:171: 

GATGATCCAT CG 



( 2 ) INFORMATION FOR SEQ ID NO:172: 

< 1 ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: Dudeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:172: 

AGACCTGATG ATC 



( 2 ) INFORMAnON FOR SEQ ID NO:173: 

( I ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nndeic acid 
( C ) STRANDEDNESS: single 
( D )TOFOLOGY: Imear 

( I i ) MOLECULE DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:173: 

GGGTCATAGA CCT 



( 2 ) INFORMAnON FOR SEQ ID NO:l74i 

( I ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: Qodeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:l74: 

ATAGGGTGAT AGA 



( 2 ) INFORMAnON FOR SEQ ID Nai7S: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: BDcleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



101 
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. ( i I ) MOLECULE TVPEDNAOiobe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:175: 

't ' • ' ■ 

T GOTTA A TAG GO. 



i 2 



( 2 ) INFORMAnON FOR SEQ ID NO:176: 

.( | ) SEQUENCE OURACTERISnCS: 
/( A ) lENCFIH: 13 base pain 
^ ^ . - ^ ( B )TyPE:i«iddcQ^ 

' ( C ) STRANDEDNESS: single^ 
( D )TOKHi>GY:lmear 

( i I ) MOLECULE TyPE:DNA(pobe) . 

' ( X i ) SEQUENCT DESai^ 

GTGAGTGGTT AAT . 



1 3 



( 2 ) INFORM/OION FOR SEQ ID NOtlTT: 

( 1 ) SEQUENCE CHARACTERISnCS: 
( A ) LEKCTIH: 12 base pabs 
( B )T)rFE:iioclek»c)(l 
< C ) STRANDEDNESS: single 
( D )TOFOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID NO:177: 

TGTGCGGGAT AT 



1 2 



( 2 ) INFORMATION FOR SEQ ID N0:17& , ' , 

( I ) SEQUENCE CHARACIERCnCS: 
( A ) LENGTH: 12 base paiis 
( B ) TYPE: nodeic octd 
( C ) SmtANDEDNESS: s'tngte 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:178: 

ACTCTTGTGC CO 

( 2 ) C^FORMAnON FOR SEQ ID NO:179i 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 13 base pain 
( B ) TVFE aodclc weSd 
( C ) SIRANDEDNESS: single 
( D ) TOPOLOGY: liaear 

( 1 I ) MOLECULE lYF^ DNA (probe) 

( X I ) SEQUENCE DESCBIFnON: SEQ ID N0:179: 

TAOCACTCTT GTG 



1 2 



1 3 



( 2 )INF0RMAn0NF0RSEQIDN0:18ft 

( I ) SEQUENCE CHARACrcRISnCS: 
( A ) LENGTH: 13 base pain 
( B )Tf¥Eziiodacaad 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TtrP& DNA Onbe) 

( X I ) SEQUENCE DESCSIPnON: SEQ ID NO:I80: ' 



5,837,832 ■ . 
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GGAGAGTAGC ACT ' , 13 

( 2 ) I>fF0RMAnON FOR SEQ ID N0:181: ! : " 

( i ) SEQUENCE CHARACTERISTICS: ' 

" A ) LENGTH; 12bascpairs ^ : 7 _ " . / - /■ . . ' " ' -. ■ '■ , . 

( B ) TYPE: nucleic acid 

, < C ) STOANDEDNESS: single . * 

. (D)TOP0LOGY:ljncar . . • . ^ ^, Z"^.-^. 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ n> N0:181: 

GCGAGGAGAG TA 12 



( 2 ) INFORMAnON FOR SEQ ID N0:182: ' ... / _ . \ . . ; . 

( i )SEQUENCE CHARACTERISTICS: ; . 

( A ) LENGTH: 11 base pairs ~ 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (piobe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID N0:182: 

CGCAGCGAGG A 11 

( 2 ) INFORMAnON FOR SEQ ID N0:183: 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 10 base pair* 
( B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: single -L -. . .■*.... 

< D )TOPOLOGY: linear 

C I J ) MOLECULE TYPE: DNA (probe) 
( X I ) SEQUENCE DESCRIPnON: SEQ ID N0:183: 
CGCCCCGAGC 1q 

( 2 ) INFORMATION FOR SEQ ID NO:184: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: sbgle 
( D )TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRimON: SEQ ID N0:184: ' 

TTATGGGCCC G 11 

( 2 ) INFORMAnON FOR SEQ ID N0:185: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D )T0P0L0GY: Imeai 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:18S: 

ACTGTTATGG OC 12 



( 2 ) INFORMAnON FOR SEQ ID N0:186: 
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. ( i ) SEQUENCE CHARACTERrsnCS: ' 

( A ) LENCiTH: 12 base pairs \ . " . 

; > . ■ - / ( B ) TYPE: nucleic acid ; ' * \ . ' 

. : . ( C ) SniA>rDEDNESS: single - . , 

' : ( D ) TOPOLOGY: linear ' : . ; . 

' ( i i ) MOLECULE TYPE: DNA (probe) ' - 

( X i ) SEQUENCE DESCRIFnON: SEQroNO:186: 

■TACCCCCAAO ^TG "-T'^'' '-'''^ ''^""'"^^^^-"^ "■ V 



< 2 ,) INFORMAnON FOR SEQ ID NO:187: - 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pahs 

( B.)TYPE: nucleic acid 

^i: , " ( C ) STRANDEDNESS: single * 

< D )TOPOLOGY: linear 

... ( i i ) MOLECULETYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:l87: 

TTTAGCTACC. CC 



( 2 ) INFORMAnON FOR SEQ CD N0:18& 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

: ■ . ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:188: 

TTCACTTTAG CTA 



( 2 ) INFORMAnON FOR SEQ ID N0:189: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
< D) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (pfobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:189: 

TACAGTTCAC TTT 



( 2 ) INFORMATION FOR SEQ ID NaiPOt 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nnclctc acid 
( C ) ffniANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ED NO:190: 

TCCAGATACA CTT 



( 2 ) INFORMAnON FOR SEQ ED N0:19l: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENC7TH: 13 base pairs 
( B ) TYPE: nudele add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i j )M0LECU1£ TYPE: DNA (probe) ; . ; 

( X OSEQUENCE DESCRIPTrON: SEQrDNO:191: , ■ , ' .. 

CAOATGTCGA /GAT ; ^' 1 ; • ' ' 



( 2 ) INFORM>aiON FOR SEQ ID N0:192: , / 

- . '^ T.T (. i ) JSEQIIH«:E CHARACHERlhlCSir^V^^'' '"'".-^ " -'^ 
( A" ) LENGTH: 12 base pairs 
( B ) TYPE: nocldc acid * 
( C ).SniANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULETYPE: D?fA(probe) 

-( "x i ) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

AGGAACCAGA TG 



( 2 ) INFORMAnON FOR SEQ ID N0:193: 

< i ) SEQtTENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nocletc acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:193: 

GAAGTAGGAA CCA 

■ ■ ■ ' ■ "■ p 

( 2 ) INFORMAnON FOR SEQ ID NO:194: ; 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: X3 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID N0:194: 

GACTGTAATO TGC 



( 2 ) INFORMAnON FOR SEQ ID NO:195: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) SriRANDEDNESS: single 
( p )TOPOLOGY: linear 

• ( i i ) MOLECULE TYPE; DNA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:195: 

GGOATTTGAC TOT 



( 2 ) INFORMAnON FOR SEQ ID NO:196: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pdn 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:196: 
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AGGGATTTGA CT 



1 2 



( 2 ) iNFORM>ffICW FOR SEQ ID NO:197: , ''. ; 

( i ) SEQUENCE CHARACIERISrnCS: 

v \ ( A ) lENCnt 12 base pars ; ' 
( B)TYFE:inidack!d 
( C )STRANDEDNESS: single 

( i I ) MOLECULE TYPE; DXA (probe) 



( X I )SEOUENCE 
ACGAGAAGGG AT 



?TTON:SEOn>NO:197: 



1 2 



( 2 ) INFORMAriON FOR SBQ ID >(Q:i9& . / 

(I ) SEQUENCE CHARACIBRtSnCS: 
( A ) LEKGTH: 12 base paiis 
( B )TYI>!Eftiicldcackt 
(C)SntAKDEDNESS:slQ£le \ _ 
( D)TOFOLOG%Uttear 

( 1 I )MOl£CULETne:I]XA(pobe) 

( X I ) SEQUENCE DESOUFIION: SEQ ID N0:19S: 

TGGGGACGAC AA 



1 2 



( 2 ) INFORMATION FOR SEQ ID Nat99: 

( I ) SEQUENCE CHARACTERISTICS: 
( A}LENGIH:12basepain 
V (B)TYFE:ftDcleic«cjil 

(C) SntANDEDNESS: single . , 

(D) T0FOLOGY:tkeai 

( I I ) MOLECULE TYrE:DXA (probe) 
( X 1 ) SEQUENCE DESOUmON: SEQ ID NO:199: 
ATCCATGGGG AC 



1 2 



( 2 ) INFORMATION FOR SEQ n>Na2Dft 

( I ) SEQ!U£NC£ CHARACIERmiCS: 
( A)LENGTH:Ubascpaln 
( B )TYFE:DocIek»cId 
( C ) SntAKDEDNESS: single 
(D)TOFOLOGY^Knear 

( i 1 ) MOLECULE Tyre; DNAdvobe) 

( X i ) SEQUENCE DESCRIFTIQN: SEQ ED NO-^ 

GGTCATCCAT CO 



1 2 



( 2 ) INFORMAnON FOR SEQ [0X0:201: 



( I ) 



I CHARACILRISIICS: 
( A ) LEKCIH: 11 base pain 
( B)TV7E:ttodeksdd 
( C ) STRANDEDNESS: shgle 
( D)T0roLOG1&Iae3i 



( I 1 )M{M£CULETyPE:D\A^xobe) 
( X I ) SEQUENCE DESOUFIION: SEQ ID NO-^1: 
AGGGGGGTCA T 



1 1 



( 2 ) INFORMATION FOR SBQ n>Ka2Q2 



r 
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_ ( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pin 
< B )TYPE:i»ideicaeid 
, ( C ) STTRANDEDNESS: single \ 
( D )TOPOLOGY:Uncai ; . 

( i i ) MOLECULE TYPE: DXA (probe) - ^ 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:202: 

■tatctgaggg 'gg"'^ Vi'^^^v: 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:203: : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base paira 
( B ) TYPE: nndoc add 
( C ) STRANDBDNESS: sin^e 
( D ) TOPOLOGY: linear 

.( I i ) MOLECULE TYPE: DNA (probe) ^ 7 

( X 1 ) SEQUENCE DESOMPTION: SEQ ID NO:203: 

ACCCCTATCT OA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:204: 

( i ) SEQUENCE CHARACirailSrnCS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I )MOLECULE TYPE: DNA (probe) 

. ( X i )SEQUENCE DESCRIPTION: SEQ ID NO:204: 

AGGGACCCCT A 



1 1 



( 2 ) INFORMAnON FOR SEQ ID NO:205: 

( i ) SEQUENCE CHARACIERtCTlCS: 
( A ) LENGIH: 12 base pahs 
( B ) TYPE: Bockic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i r ) MOLECULE TYPE: DNA (pcobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:205: 

TGGTCAAGGG AC 



1 2 



( 2 ) INFORMAnON FOR SEQ CD NO-.206: 

< i ) SEQUENCE CHARACIERISnCS: 
( A )LENCTH:12basepa!is 
( B ) TYPE: DDdeic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: tmear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

GGATCGTCGT CA 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:207: 

( i ) SEQUENCE CHARACTERimCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nodek acid 
( C ) ^ntANDEDNESS: sh^e 
( D ) TOPOLOGY: linear 



5,837,832 
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. . ( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:207: / - 
AGGATGOTGG TC ; ; ■ ' . 



( 2 ) INFORMAnON FOR SEQ ED NO:208: 

( I ) SEQUENCE CHARACTERISTICS: ^ ' ' ' ' ' ' ' 
( A ) LENGTH: 12 base pairs 
( B ) TVTEi nucleIc odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) . 

. ( X i ) SEQUENCE DESCRIPnON: SEQ ID NOiioS: - 

ACACOGAGGA TG 

( 2 ) INFORMAnON FOR SEQ ID NO:209: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE* nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( 1 i )MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:209: 

TGATTTACAC GG 



( 2 ) INFORMAnON FOR SEQ ID NO:210: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0:2it): 

GGGATATTGA TTT 



( 2 ) INFORMAnON FOR SEQ ID N0:21l: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: abgle 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:21t: 

GTOGCATTTO OA 



( 2 ) INFORMAnON FOR SEQ ID Na212: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B )TYPE:nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ED NO:212: 
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AGGGCTGGCA T 



1 1 



( 2 ) INFORM>aT0NFORSEOIDN0i2li , 7 . 

( i ) SEQUENCE CHARACTERISTICS: ; 
; " ; (A)lJEKCIH:Ubascpai» . 

( B )TyPE:Ei»cIeicacid 
■ : . ( C ) SniANDEDNESS: tingle * 
■ . , ; ( D ) TOPOLOGY: linear . ■ . - 

. ( I i ) MOLECULE TYPE: DNA<p«*e) ^ 

( X i ) SEQUENCE DESCRIFnON: SEQ ID N0:2U: 

GGTGAGGGGT G 



1 1 



( 2 ) INFORMAnON FOR SEQ ID VOaiAt - ' 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGIH: 12 base jain 
( B ) TYPE nodeic acid 
( C ) SIItANDEDNESS: smgtc 
(D)TOFOLOCY: linear 

(11 )MOUECULE TYPE: DXA (probe) 

( X i ) SEQUENCE DESCRUTION: SEQ ID Na214: 

AGTGGCTGAG GG 



1 2 



( 2 ) INFORMATION FOR SEQ ID Na:21S: 

( I ) SEQUENCE CHARACIERISnCS: 
( A)L£KCIK:13basepa]n 
( B ) TYPE- nndcic acid 
( C ) CTRANDEDNESS: iingic 
( b )T0f6lOGY: linear 

( I i ) MOLECULE TYPE: DXA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID N0:21S: 

GTATCCTAGT GGG 



1 3 



( 2 ) INFORMATION FOR SEQ ID N0:21fi: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) UENGIK: 13 base pain 
( B ) TYPE: nnctelc acM 
( C ) STRANDE0NESS: skgle 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:216: 

TTTGTTGGTA TCC 



1 3 



( 2 ) INFORMAnON FOR SEQ ID N0-J1>. 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: U base pabs 
( B ) TYPE: BDddc acid 
( C ) SniANDEDNESStsfa^e 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCmniON: SEQ n>NO:217: 

GTACGTTTGT TGG 



1 3 



( 2 ) INFORMAnON FOR SEQ ID K0:2i& 



- i ^ ■ 5,837,832 
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; ( i ) SEQUENCE CHARACIERrsmCS: \ ' ' 

: ; (A)lJENCIH:l2basepaira ^ - - . 

■ : , .( B ).TYFE: nndcic add {~ • ; 

( C ) STRANDEDNESS: sin^c 
■ ( D )T0P0LOGy: linear . ' ; \ 

( i i . ) MOLECULE TYPE: DNA' (probe) •> ' - r- _ 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:218: ^ 

XTGOOTAGOTT "fG^ ■ .'■'■■^>.'>/: 



( 2 ) INFORMAnON FOR SEQ ID 

( i ) SEQUENCE CHARACTCRISncS: 
( A ) LENC7IH: 12 base pahs 

. „1 _ ( B ) TYPE: inideic Bcid . - _ _ -_ . ... 

C ) STRANDEDNESS: single V ' 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

< X i ) SEQUENCE DESOtlFnON: SEQ ID N0:219: 

TAAGGGTGGG TA 



( 2 ) INFORMAnON FOR SEQ ID NO:220t 

( i ) SEQUENCE CHARACIERISTICS: 
( A ) LENGHL- 13 bwc pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNHSS: single 
( D ) TOPOLOGY: linear 

( ! I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESOtlFnON: SEQ n> NO:220: 

GTACTGTTAA GGG 



( 2 ) INFORMAnON FOR SEQ CD NO:221: 

( i ) SEQUENCE CHARACIERISnCS: 
(A)LEKGTH:14 base pain 
( B )TYPE nadeic>cid 
( C ) SntAKDEDNESS: single 
( D ) TOPOLOGY linear 

( i I ) MOLECULE 1YFE:DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ IDNO:22l: 

TGTACTATGT ACTC 



( 2 ) INFORMATION FOR SEQ ID NO:222: 

( I ) SEQUENCE CHAHACIERISnCS: 
( A ) LENGIH: 13 base pairs 
( B )TYPE:BvdacBctd 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFnON: SEQ ID NO'^22: 

GGCTTTATGT ACT 



( 2 ) INFORMAnON FOR SEQ ID 140223: 

( 1 ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: naddc add 
( C ) STRANDEDNESS: shgle 
( D )T0POLOGY linear 
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,( i .i ) MOLECULE TYPE: DNA (probe) 
. : ( X i ) SEQUENCE DESdlirnON: SEQ n> NO:223: 
A A A T G G C T T T AT ■ - - ' ' „ . . 

( 2 ) INFORMAnON FOR SEQ ED NO:224: 

-' ( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
^ ( B ) TYPE: iiTiclcJc add 
( C ) SIHANDEDNESS: single 
(D)TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESGRIPTION: SEQ ID NO:224: 

GGTAAATGGC TT 



1 2 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO:225: 

( I ) SEQUENCE CHARACrERISnCS: 
( A ) LENGTH; 13 base pairs 
( B )TYPE:iiiidcic acid 
( C ) SIRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPHON: SEQ ID NO:225: 

TGTACGGTAA ATG 



1 3 



. ( 2 ) INFORMAnON FOR SEQ ID NO:226: . : . , 

( I ) SEQUENCE CHARACTERrsnCS: - 
( A ) LENCriH: 13 base pairs 
( B ) TYPE: ntteleic add 

< C ) STRANDEDNESS: single 

< D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:226: 
GTOCTAATGT ACO 

( 2 ) INFORMAnON FOR SEQ ID NO:227: ~ 

< I ) SEQUENCE CHARACTERISTICS: 
( A ) LENCIK: 13 base pairs 
( B )TYP£:codeIc Bcid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY^ linear 

( 1 1) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:227: 

TAATGTGCTA ATG 



1 3 



1 3 



( 2 ) INFORMAnON FOR SEQ ID NO:228: 

( 1 ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: mideic acid 
( C ) STRANDEDNESS: singte 
( D) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 



( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
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CATGGGGAGG. G. 



1 1 



( 2 ) INFORMAnON FOR SEC? ID NO:229: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs . 
( B ) TYPE: nnclcic acid 
, ( C ) SrntANDEDNESS: sinfile 
^ ( D ) TOPOLOGY: linear . ' 



( i I )MOLECULETYPE:DNA^obc) 
( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:229: 
TGTAAGCATG GG 



1 2 



( 2 ) INFORMAnON'FOR SEQ ED NO:23a~ ' ^ t^-^ ,- . 

( i )SEQUENCE CHARACTERISTICS: - 
( A ) LENGTH: 13 base pairs 
( B )TYPE:nBcleicacid 
( C ) STRANDEDNESS: single 
, ( D )T0POLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:230: 
» 

TTGCTTGTAA GCA 



1 3 



( 2 ) INFORMATION FOR SEQ ID NO:231: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
' ( B ) TYPE: nucleic acid 

( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:23l: 

TGTACTTGCT TGT 

( 2 ) DiFORMAOON FOR SEQ ID NO:232: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
< B ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TVTE: DNA (piobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:23Z* 

TTCCTGTACT TGC 

( 2 ) INFORMAnON FOR SEQ ED NO:233: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGIK: 12 base pairs 
( B ) TYPE: noctelc add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(II ) MOLECULE nrpE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ CD NOt233: 

GGTTCATTGC TO 



1 3 



1 3 



1 2 



( 2 ) [NFORMAnON FOR SEQ ID NO:234: 
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(I ) SEQUENCE CHARACTERISTICS: ^ 
; ^ ( A ) UENGTH: 12 base pairs 

( B ) TYPE: nucleic acid - .. . ■ 

( C ) STRANDEDNESS: aiagic 
■ ( D)T0P0LOGY: linear / ; 

( i i ) M01£aJlJB TYPE: DNA (probe) / - — 

. ( X i ) SEQUENCE DESCRUnON: SEQ n> NO:234: 

( 2 ) INFORMAnON FOR SEQ ID NO:235: 

( i )SEQUENCECHARACrERISrnCS: 
( A ) LENGrTH: 13 base pairs 

( B )TYPE:nnclcic acid , . „ L 

I C ) SntANbEDNESS: single 
< D )T0P0LOGY: linear 

; ■ 1 '. ■ ■ ■ 

( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:235: 
GTGATAGTTG AGO 



( 2 ) INFORMAnON FOR SEQ ID NO:236: 

( t ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 13 base pairs 
( D )TYP£: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFHON: SEQ ID NO:236: 

TTGATGTGTG ATA 



( 2 ) INFORMAnON FOR SEQ ID NO:237: 

( I ) SEQUENCE CHARACTERtSnCS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE: nndcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I )MOLECUUE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFHON: SEQ ID NO:237: 

TGCAGTTGAT GTG 



( 2 ) INFORMATION FOR SEQ ID NO-^& ) 

( 1 ) SEQUENCE CHARAClBRISnCS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE:nnclclc»cid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIITION: SEQ ID NO:238: 

TGGAGTTGCA GT 



( 2 ) INFORMAnON FOR SEQ ID Na239: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic kcid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



125 



' ( i i ) MOLECULE TOEDNA (probe) , 

( X J ) SEQUENCE bESCWFTION: SEQ ID NO;239: 
A T T T O O A O T T .. G C . ' ~ ■ 
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( 2 ) INFORMAnON FOR SEQ ID N6:24a 

^ " : ' ( i ) SEQUENCE CHARACTERKnCSr , ~ : 

( A ) LENGTH: 13 base pairs 
\'( B )TYPE:iiiicleIc add 

( C ) SrTRANMDNESS: shgle ' 

( D ) TOPOLOGY: [ineai 

f ■ .... 

( i I ) MOLECULE TVPEiDNA (probe) 
(x i ) SEQUeiCEDKawiTldN: SEQ ro^^ 
TACCGTACAA TAT 



( 2 ) INFORMAnON FOR SEQ ID NO-^l: 

( I ) SEQUENCE CHARACTERISnCS: 
< A ) LENGIH: U base pairs 
( B )Ty?&naclcIcacid 
( C ) STRAKDEDNESS: single 
( D )TOFOLOGY: linear 

( I i ) MOLECULE TlfPfi DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:241: 

TGGTACCOTA CAA 



( 2 ) INFORMATION jpOR SEQ IDKO-J42: / 

( I ) SEQUENCE CHARACIERBTICS: 
( A ) LENGnt U base pairs 
{ B ) TYPE: DDcIctc odd 
( C ) STRAKDEDNESS: single 
( D)T0POLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

TATTTATOGT ACC 



( 2 ) INFORMAnON FOR SEQ ID NO:243: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LEKGIK: 13 base pairs 
( B ) TYPE: inideic odd 
( C ) SntANDEDNESS: single 
( D ) TOPOLOCV; linear 

( I 1 ) MOLECULE TyFE:DNA([xobe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO-^43: 

GOTCAAGTAT TTA 



( 2 ) INFORMAnON FOR SEQ ID }«>'.244£ 

( I ) SEQUENCE CHARACTCRISnCS: 
( A ) LEKOnt Ubase pairs 
( B ) TYPE BBdeic add 
( C ) SniWDEDNESS: single 
( D ) TOPOLOGY: linear . 

( I I ) MOLECULE TlfPEDNA (probe) 



( X 1 ) SEQUENCE DESOOPTION: SEQ ID N0^44: 
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TACAGGTGGT CAA 



( 2 ) INFORMAnON FOR SEQ ID NO:245: 

( i ) SEQUENCE CHARACTERISTICS: 
_ ( A ) LENGTH; 13 base pain 
( B ) TYPE: nucleic acid 
( C ) STTRANDEDNESS: single 
. . ^ , ■ i : ( P ) TOPOLOGY: linear , 

(II) MOLECULE TYPE: DNA (probe) ' 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:245: 

ATGTACTACA GOT 



( 2 ) INFORMAnON FOR SEQ ID NO:246: : 

. (I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STTRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I )MOUECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:2«: 

GGTTTTTATG TAC 



( 2 ) INFORMATION FOR SEQ ID NO:247: 

( I ) SEQUENCE CHARACTERISTICS: 
( A )LENGTH: 12 basepaus 
( B )TYPE:midcicacid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

X I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:247: 

GGATTGGGTT TT 



( 2 ) INFORMATION FOR SEQ ID NO:24a: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic add 
< C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

TGTAGGATTG GG 



( 2 ) INFORMAnON FOR SEQ ID NO:249: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:249: 

GTTTTGATOT AGO 



( 2 ) INFORMATION FOR SEQ ID NO:25Q: 
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( I ) SEQUENCE CHARACreRlSnCS: 
( A )LENGTH:12bascpairs 
. - ( B )TYFE: nucleic acid 

'' , ( C ) STRANDEDNESS: single ? 
' - (D)T0p6L0Gy:linc^ - - 

. ( i i,)MOIJECUl£TYPE: DNA (fiobc) : . , : 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:250: 

'"GGGTTTTOAT O T - -^'"^ 'v''4:^^'?^ H'v"^ ' 

* ( 2 ) INFORMAnON FOR SEQ ID N0;2SL: ' ^ 

( i ) SEQUENCE CHASACTERrsnCS: 
( A ) LEKCTTH: 11 base pairs 
;„ (B)TYPE:naddcacid 

/ . - ; '''['" ( c ) srniANbEDji^: siiigie . 

( D )TOP0LOGYi imcar 
'( i i ) MOLECULETOE:DNA (probe) 
( X I ) SEQUENCE DESCRIFTION: SEQ ID NO:251: 
GGAGOGGGTT T . 

( 2 ) INFORMAnON FOR SEQ ID NO:252: 

( i ) SEQUENCE CHARACTCRBTICS: 
( A ) LEKOIH: U base pairs 
( B )TYPE:iiocIctc8^ 
( C ) STRANDEDNESS: single 
( D)TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 
( X I ) SEQUENCE DESCRDTTON: SEQ ID NO:252: 
GTCAATACTT GGG 



( 2 ) INFORMAnON FOR SB} ID KO:2S3: 



( i ) SEQUENCE CHARACIERBTICS: 
( A ) LENCmi: 13 base pairs 
( B ) TYPE: niKletc acid 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: Imcar 

( I I )MOLECULETYFE:DNA (probe) 

( X I ) SEQUENCE DESCRimON: SEQ ID NO:253: 



GGGTGAGTCA ATA 



( 2 ) INFORMATION FOR SEQ ID l<Kh2S4: 



( I ) SEQUENCE CHARACIERISnCS: 
( A ) LEKGni: 12 base pairs 
( B ) TYPE: nodcic add . 
( C ) SntANDEDNESS: akgle 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA(pr<^e) 

( X I ) SEQUENCE DESCRIPIION: SEQ ID NO-^: 



TGGGTGAGTC AA 



( 2 ) INFORMAnON FOR SEQ ID MO*^S: 

( 1 ) SEQUENCE CHARACIERiynCS: 
( A ) l£NCTH: Ubasepairs 
( D ) TVre: Dodelc add 
( C ) SntANDEDNESS: shgle 
( D )T0P0LOCT6 linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
' ( X i ) SEQUa^CE DESCRIPTION: SEp ID NO:255: 
TGTTGATOGG TG 



( 2 ) INFORMAnON FOR SEQ n> NO:25& 

i ^ ( i" ) SEQUENCE CHARACTERISTICS: i 
. ( A ) LENGTH: 12 base pairs 
..(B) TVPE: nocleic acid 

( C )SniANDEDNESSishglc ' 
( D ) TOPOLOGY: lineai 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

CGGTTGTTGA TG 



( 2 ) INFORMAnON FOR SEQ ID NO:257: 

( I ) SEQUENCE GHARACTERISTICS: 
( A ) l£NGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) CTRANDEDNESS: single 
( D )T0POLOGY: linear 

( 1 1 ) MOLECULE TVPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:257: 

ACATAOCGOT TO 



( 2 ) INFORMAnON FOR SEQ ID NO:258: 

< J ) SEQUENCE CHARACTERISmCS: 
( A ) LENGTH: 13bascpain 
( B ) TYPE: coclelc add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

GAAAATACAT AGO 



( 2 ) INFORMAnON FOR SEQ ID NO;2S9: 

( \ ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 basepaira 
( B )TYPE: nucleic acid 
( C ) STKANDEDNESS: smgle 
( D)TOPOLOGY: linear 

( 1 i > MOLECULE TYPE: DNA (pr«be) 

( X 1 ) SEQUENCE DESCRIFnON: SEQ ID NOi259: 

AATGTACGAA AAT 



( 2 ) INFORMAnON FOR SEQ ID Na26a 

( I ) SEQUENCE CHARACTERISnCS: 
; ( A ) LENGTH: Ubase pain 

( B )TYPE:nticteicidd 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRtPTION: SEQ ID NO-^60i 



5,837,832 

-continued 

GCAGTAATGT ACC . . ' 



( 2 )INFORMAriONFORSE6 lDNO:261: ' 

( i ) SEQUENCE CHARACTCRISTICS: ^ *\ 
. ' ; : . '(A)LEN(JrH:12bascpaira ' 
j" ( B )TYFEtnDcteic 8cid 
■ ( C ) SIHANDEDNESS: single . 

\ _ ( D)TOPOLOGY:lmear . „^ . 

( i J )MOLECUl£ TYPE: DNA (probe) ■ . 

( X ! ) SEQUENCE DESCRIPTTON: SEQ ID NO:261: 

TGGCTGGCAG TA 



"( 2 ) IJfFORMAnON FOR SEQ ID n6:262: \" " ~ • 

( i ) SEQUENCE CHARACIERISnCS: . 

( A ) t£NCTH: 12 base pairs 
( B )TyFE:iiQcleic8cid 
( C ) STRANDEDNESS: single 
( D)T0P0LOGY:Uaeaj 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTIQN: SEQ © NO.-262: 

TCATGGTGCC TO 



( 2 ) INFORMATION FDR SEQ ID NO:263: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) lEKGTH: 13 base pain 
( B )TYPE- nucleic acid 
.( C ) SraANDEDNESS: single . 
( D)T0POLOGY: ihear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NOa63: 

ACAATATTCA TGC 



( 2 ) INFORMATION FOR SEQ ID NO:264: 

( 1 ) SEQUENCE CHARACTERISnCS: 
( A)LEKGrH:13 base pairs 
( B )TYPE:cDcIeic Add 
( C ) STRANDEDNESS: single 
< D)TC»>0LOCY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ IDNO-^: 

TAGAATCTTA OCT 



( 2 ) INFORMAnON FOR SEQ ID NO-^S: 

( i ) SEQUENCE CHARACIERISnCS: 
( A)LENGTK:13 base pairs 
( B)TYFE:nDde{e add 
( C ) SIRANDEDNESS: shgle 
( D)T0FOLOGY:lice3i 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE D£SCRI?nONr SEQ ID NO:255: 

TTTAAATTAC AAT 



( 2 ) INFORMAnON FOR SEQ ID KO:266: 
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( i ) SEQUENCE CKARACTERISnCS: 
. ( A ) LENGTH: U base paira 
, ' ( B )TTPE:BncIcic acid 

( C )'SrrRAKDEDNESS: single 
( D)TOPOLOGY: linear 

( i 1 )MOLECULETyPE:DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:266: 

GAATAAGTTT ^AAA'^'^^r". " ^'^^^ 



1 3 



( 2 ) INFORMATION FOR SEQ © NO:267: 

( i ) SEQUENCE CHAaACTERrsnCS: 
( A ) lENGIH: 13 base pairs 
( B ) TYPE: nucleic acid 
■ ■ ( C ) SmwroEDN]^: single 

( D )TOroLOGY: linear 

(11 ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:267: 

GAACAGAGAA TAA 



( 2 ) INFORM/aiON FOR SEQ ID NO:268: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LEKGIH: 13 base pain 
( B )TYPE: nocleic acid 
( C ) SrRAM)£DNESS: single 
( D)TOPOLOGV: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:26S: 

AAAGAACAGA GAA 



( 2 ) INFORMAnON FOR SEQ !D Na269: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE:ancIeicacId 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 



( I 1 ) MOLECULE TYPE: DNA (pcobe) 
( X i ) SEQUENCE DESCRUTION: SEQ ID NO;269: 
CCCATGAAAG AA 



( 2 ) INFORMAnON FOR SEQ ID KO:27Qt 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE:Bitcldc add 
( C ) SntANDEDNESS: sing;le 
( D )T0POLOCY: linear 

( I 1 ) MOLECULE TYPE: DNA (iprobe) 

( X ! ) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

TTCCCCATGA AA ^2 



( 2 ) INFORMAnON FOR SEQ ID KO-^1: 

( i ) SEQUENCE CHARACreRISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: Dodeic Kid 
( C ) STRANDEDNESS: single 
< D )T0POLOGY: linear 



5,837,832 

. • -continued 



, ■( i i ) MOLECXJLE TYPE: DNA (probe) 
. ( X i -) SEQUENCE DESCRIFHON: SEQ IDNO:271: - 
ATCTGCTTCC'CC ^ , . 



( 2 ) INFORMAnON FOR SEQ ID NO:272; 

' T (I ) SEQUENCE CHARACTERISTICS: 

.( A ) LENGTH: 12 base pairs 
' . "*• ""*■'"( B ) TYPE: noctelc acid * ' ' 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) . . 

■ ; ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

CAAATCTGCT TO 



( 2 ) INFORMAnON FOR SEQ ID NO:273: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
< B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

GGTACCCAAA TC 



( 2 ) INFORMAnON FOR SEQ ID NO:274: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:274i 

OCTOGTACCC AA 



( 2 ) INFORMAnON FOR SEQ ID NO:275: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 bose pairs 
( B ) TYPE: nadeic add 
( C ) CTRANDEDNESS: shglc 
< D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:275: 

TACTTOOGTO GT 



( 2 ) INFORMATION FOR SEQ ID Na27«: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B )TYPE:nndeic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 



( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:276: 
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TGGAAAAAGG TT 



1 2 



( 2 ) INFORMATION FOR SEO U> VOJTJi 

( i ) SEQUENCE CHARACIERISnCS: 

( A )lJB:Gni:i2basepun • 
( B}Tyi£»Klekact4 
( C )SmAND£DN£SS: single 
( D}TOFOLOGK linear 

( I i ) MOLECULE TWa UNA (probe) 

( X i ) SEQUENCE DESCBiniQN: SEO ID NO:277: 

GTCCTTGGAA AA 



1 2 



C 2 ) INFORMATION FOR SEQ ID M>278: 

( i ) SEQUENCE CHARACIERISmCS: 
( A ) LENCnt 12 base pun 
( B )TYmiiadcic8cid 
( C ) SntANDEDNESS: single 
(D)T07010GY.Utiear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DCSCRirnON: SEQ ID NO:278: 

ATTTCTCCTT GG 



1 2 



( 2 ) INFORMATION FOR SEQ ID 

( I ) SEQUENCE CHARAOERtSTZCS: 
( A ) LENGIH: n base purs 
( B )TyF&itKldcactc) 
< C ) STRAKDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE irPE: DMA (probe) 

( X I ) SEQUENCE DESCBimON: SEQ ID NO:279: 

CTCTGATTTG TCC 



1 3 



( 2 ) INFORMATION FOR SEQ ID MQc28Gb 

( i ) SEQUENCE CHARACIERrSnCS: 
( A ) LENCIH: U base pairs 
( B )TYFBaKtdcKld 
( C )STRAND£IWE5S: shgte 
( D )TOPOLOGVi linear 

( t I ) MOLECULE TYFE: UNA ^obc) 

( X i ) SEQUENCE DESGStniDN: SEQ ID NO:280: 

TTTTTCTCTG ATT 



1 3 



( 2 ) INFORMAnON FOR S^TDKOOSli 

( 1 ) SEQUENCE CHASACIERISTICSc 
( A ) LENGItt 15b3se pabs 
( B )TYr£BBclckadd 
, ( C ) SntAKDEDNESS: s&gle 
( D ) TOPOLOGY, linear 

( 1 1 ) MOLECULE nrPB UNA (pcobe) 

( X i ) SEQUENCE DESOttfTlON: SEQ ID NO:281: 

TAAACACTTT TTC 



1 3 



( 2 ) INFORMAnON FOR SEQ ID ]«Oc282£ 
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(I ) SEQUENCE CHARACTERISTICS: ^ 
' ( A ) LENGTH: 13 base pain 

; . . : " ( B ) TYPE: nodclc acid 
■ * ' ; . ( C ) SmANDEDNESS: ainglc ■ . • ' 

'\ ( D )TOPOLOGY: linear ' . / 

( i i ) MOLECULE TYPE: DNA (probe) j 

( X I ) SEQUENCE DESCRimON: SEQ ID NO:2S2: 

:;GTGGAOTTAA VAGA' V'-^'' 'vy^^.' \ "^v.-; 

. " * ' ' "** " 

( 2 ) INFORMAnON FOR SEQ ED NO:283: - , 

( i )SEQUENCECHARACrERISTTCS: ' 

( A ) LENGTH: 13 base pairs , 

. ( B)TYPE:nTideicadd ^ 

- - - ( C )CTRANDEbNESS:'sii^"e -""T - • • V "^^ 

( D )TOPOLOGY: linear ■ . 

" ( i i ) MOLECULE TYPE: DNA (probe) ' 

( X I ) SEQUENCE DESCRffTION: SEQ ID NO:2a3: 

TGGTGCAGTT AAA 



( 2 ) INFORMAnON FOR SEQ ID NO:284: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nodetc acid 
( C ) STRANDEDNESS: sttifle 
< D)TOFOLOGY: linear 

(II ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUeiCE DESCRIPTION: SEQ ID NO:284: 

TGCTAATGGT GG 



( 2 ) [NFORMAnON FOR SEQ [D Na2SS: 

< 1 ) SEQUENCE CHARAClBRISnCS: 
( A )I£NGn{: 12 base pairs 
( B)T»fPE:nndeicac*K! 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: linear 

( 1 I )MOLECULETYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

TTGGGTGCTA AT 



( 2 ) INFORMATION FOR SEQ ID NO:2S6: 

( I ) SEQUENCE CHARACIERISmCS: 
( A yi£S<n}b 12 base pairs 
( B )TYPE:iiiHie{cadd 
( C ) SnrtlANDEDNESS: I'm^Ie 
( D ) TOPOLOGY: linear 

( I 1 )MOLECULETYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:286: 

TAGCTTTCGG TO 



( 2 ) INFORMAnON FOR SH) CD NO:287: 

( 1 ) SEQUENCE CHARACTERI^CS: 
( A ) LENGTH: 12 base pairs 
( B)TYPE:Dadeleacid 
( C ) SniANDEDNESS: flngle 
(D)T0POLOGY:Uttear 
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( i i )M01ECULETYFE:DNA (probe) 

( X i ) SEQUENCE"DESaiinnbN:SEQ IDNO:2S7: : ' ; . . 

TCTTACCTTT GO - : ' 12 

( 2 )INFORMATIONFORSEQIDKO-^ 

: ' ( i )SEQUENCE'CHARACTERISrnCS: ^ ' ' ' . - . j.^- r r: - 

( A ) LENGTH: 22 base palis 
■ ' - ; ( B ) TYPEimiicIdc ecid 

( C ) STRAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TWE:I>NA (probe) 

( X i ^SEQUENCi DESCWtmON: SEQ m .: " 

CACTTGTGCC CTGACTTTCA AC _ 2 2 

( 2 ) INFORMAnON FOR SEQ © 

( I ) SEQUENCE CHARACIERISnCS: * . 

( A ) LENGTH: 49 base pain 
( B ) TYPE Bodcic add 
( C ) STRAKDEDNESS: sbgle 
( D )TOF(H.OGY: linear 

( i i ) MOLECULE TWEiDNA (probe) 

( X i ) SEQUENCE DESCRinTON: SEQ [D NO:289: 

ATGCAATTAA CCCTCACTAA AGGGAGACAC TTGTGCCCTG ACTTTCAAC 49 

( 2 ) mFORMAnONFORSEQn)NO:290t„ : ; _ ^ j ; , . . . . , , , 

( ! ) SEQUENCE CHARACTERISTICS: 
( A ) LENGnt 25 base pairs 
( B ) TYPE: nocletc Kid 
( C ) SntAKDEDNESS: single 
( D ) TOPOLOGY: Uiteiir 

( ! I ) MOLECULE TYPE: DXA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:290: 

GACCCTGGGC AACCAOCCCT CTCGT 25 

( 2 ) INFORMAnON FOR SEQ ID NO:29h 

( i ) SEQUENCE CHARACTERISTICS: 
( A)tENGTH:47basepa!rs 
( B ) TYPE BBdeic sctd 
( C ) STRAKDEDNESS: single 
( D )TOFOLOCY: linear 

( I i ) MOLECULE TYPE: DXA (probe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NO:291: 

TAATACGACT CACTATAOCG AOGACCCTGO OCAACCAOCC CTGTCOT 47 



( 2 ) INFORMAnON FOR SEQ ID NO£292: 

( 1 ) SEQUENCE CHARACTERESnCS: 
( A )LEKGTU:25bfts«paln 
( B ) TYPE: nacieic add 
( C ) STRAKDEDNESS: smgte 
( D ) TOPOLOGY: Uoear 

( I i ) MOLECULE TyFE:DNA (probe) 



( X t ) SEQUENCE DESCBIPTION: SEQ ID NO-^92: 
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GT AGAATTCT . GTTGACTC AG ATTGG 



2 5 



( 2 ) INFORMAnON FOR SEQ ID NO:293: ' . ' 

■ ( i ) SEQUENCE CHARACTERISTICS: 
■ ( A ) LENGTH: 27 base pairs ; V 

' ' ^ ;^-'( B )TYPE:ni]cIeicacid 

. < C ) STRANDEDNESS: single 
' ( D ) T9POLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:293: 

AAATCCATAC AATACTCCAG TATTTGC 

-( 2 ) iNk)RMAn6N ^^^^ 

( i ) SEQUENCE CHARACTERISTICS: - 

( A ) LENGTH: 27 base pairs ' " 
( B ) TYPE: nadcjc acid 
( C ) STRA>rDEDNESS: single 
. ( D )T0P0LOGY: linear . 

(II ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

GATAAGCTTG GGCCTTATCT ATTCCAT 

( 2 ) INFORMATION FOR SEQ ID NO:295: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENCrTH: 28 base pairs 
( B ) TYPE: nucleic acid 

. . ( C ) STRANDEDNESS: single - 

- ( D )TOPOLOGY: linear 

- ( I I ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:295: 

ACCCATCCAA AGCAATGCAG GTTCTTTC 

( 2 ) INFORMATION FOR SEQ ID NO:296: 

( I ) SEQUENCE CHARACIHIISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I 1 ) MOLECULE TYP& DNA (oligonucleotide) 

( X I ) SEQUENCE DESCRIPTION: SEQ H) NO:296: 

AGCCTAGCTG AA 

( 2 ) INFORMAnON FOR SEQ ID NO:297: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TTPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (oUgonucIeoiidc) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO-^97: 

TCGGATCGAC TT 



2 7 



2 7 



2 8 



1 2 



1 2 



( 2 ) INFORMAnON FOR SEQ ID N0-^& 
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(I ) SEQUENCE CH«RACIERISnCS: , ' 
( A ) L£KGI& 22 lM5e pa!ra 
. . . ( B )Ta&»clcK.iieid' •"..I. 
■.,( C )SiaaiBEnNESS: single /"^^ ^ . '-.T', - ■ - 

:\ ( D )TOHaJDC%ImcaT;'. ■ , • .-^ - " ' ■ 

: ( i i ) MOLECULE THE DKA (probe) ' - - • 

( X i )SEQUENCEnBCBIFnON:SEQIDNO:298: . 

CGGAATTAA'C ^ CCTCACTAAA- 'GG '-V^V'"''^^ ^" " ■ . 22 

( 2 )INF6RMAnONFORSB0B>lKk29fc 

( i ) SEQUENCE CfflBtACIERISTICS: 

.( A )L£l3Cr!& 22basep^ ... - 

. . J ( B )Tifl!&«»deic»dd j . ,^ . - . . . 

> ' ' ( C ) SISAKDEEWESS: wn^c ' - ^ : : ^ ~ . „ _ 

(D)TaPOU)GlGUiKar . 

( i i ) MOLECULETnRBWA^obe) ^ 

( X i )SEQUENCEIIBasnK>N:SEQn>NO:299: . . 

AATTAACCCT CACTAAACGG AG * 22 

( 2 ) INFORMAHON for SBQIDXaJOOt 

( i ) SEQUENCE CBttACIERISnCS: 
( A ) LEKdU: 22 base pain 

( C )SnUKI)E[»mSS:*ro£|e 
( D )TaKlU)GY: rtneu 

( M )MOl£CULETl?fcl)NA(pfobc) 

( X I )SEQUENCEBSSC8irnON:SEQ©NO:300: . „ 

TAATACGACT CACTATAGGC AG 

( 2 ) INFORMAHON FOR SBQIDNO£301: 

( i ) SEQUENCE CmUtACIER^CS: 
( A ) LBQ(>IH: 201n$c pui» 
( D )TnEvKteteadd 
( C ) SnuaWESmSS: smgte 
( D )TGnU)G% Imcar 

( 1 1 )MOLECULE'ni& SNA (probe) 

( X 1 ) SEQUENCE CBamiONrSBQIDNOiSOl: 

ATTTAGGTGA CACIATAGAA 2 0 

( 2 ) INFORMAHON FOR SEQI>K]t3Q2: 

( 1 ) SEQUENCE CBOACIERISnCS: 
( A ) LEKIH: 10 base palis 
( B )Tn£KlaeacU 
( C )SnUIKI]EDNESS:uB£le 
( D )TOnU)Gn& Eoeai 

( 1 I )MOLECULET1f&r9U([robe) 

( X i ) SEQUENCE UflUUliON: 5EQ ID NO;3Q2: 

GATNATATTT 



( 2 ) INFORMAnON FOR 



( 1 ) SEQUENCE CHMUCIESISnCS: 
( A ) LEH3ft 10b>se pabs 
( B )TrKnKlekadd 
( C )5nuaa£DN£SS: shgle 
( D)TOnUXn&tiiteir 



1 0 
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. ( i i ) MOLECULE TYFEzDNA (probe) ; . 
;( X i ) SEQUENOE DtSCRIFnON: SEQ ID NO:303: 
AGANGAT ATT . ^' ' > \ / 

( 2 ) INFORMAnONFORSEQn>NCh304: • 

; . r, \ • \ ( I ) SEQUENCE CHJ^ctmsmcs: y :^^ 

( A ) LENGTH: 10 base pairs 
■. ; ■ ' . - 'S : '(B) ryPE: HDdelc acid ■ ' V ■ ^. ' ' 
( C ) STOANDEDNESS: shi£lc 
( D )TOP01jOGY: Cnear 

( i i )MOLECUUETyPE:DNA (probe) , 

, ( X i ) SEQUENCE DESOlbTION: SEQ ID NO-304: 

AAGNTGATAT - 

( 2 ) INFORMAnON FOR SEQ ID NOt305: 

( I ) SEQUENCE CHARACIERISncS: 
( A )l£NGTH:10basepa!rs 
( B )TYFE:nDdcU: add 
( C ) SmtANDEDNESS: sbfle 
( D )T0t>0LOGY: linear 

( i 1 ) MOLECULE TVTEiDXA (probe) 

( X 1 ) SEQUENCE DESCfUFnON: SEQ ID NOJOS: 

AAANATOATA 

( 2 ) INFORMAnON FOR SEQ ID NO-j06e . , . /' 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 10 base pain 
( B )TYFE:Dvde{cocId 
( C ) STRANDEDNESS: ahfte 
(D)T0FOLOCY:tmear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRfmON: SEQ ID NO:306: 

CAANOATOAT 

( 2 )INFORMAnONF0RSEQIDN0J07: 

( I ) SEQUENCE CHARACIERl^CS: 
( A ) LENGIK: 10 base pairs 
( B ) TYPE: Bodelc acid 
( C ) SIRANDEI»<ESS: stn^ 
( D )T0P0LOGie liaear 

(11 ) MOLECULE nrPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFIION: SEQ ID NO-J07: 

CCANAGATGA 

( 2 ) INFORMAnON FOR SEQ m Na30& 

( I ) SEQUENCE CHAHACIERISncS: 
. ( A )LENCTH: tObasepabs 
( B )TY7BBtideicacid 
( C ) STRANDEDNESSc tHajflti 
( D)TOroL0G1CtiDcar 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCSIFIION: SEQ n>NO-30S: 



1 0 



1 0 



1 0 



1 0 



1 0 
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ACCNAAGATG 



1 0 



( 2 ) INFORMAnON FOR SEQ ED NO-J09: 

( J )SEQUENCECHARACreRlSTICS: 
; . . (A)LENGriH:10b3scpaii5 

; ' ( B )TYPE ttocIdcadd : ' 

* :( C)SrrRANDEDNESS:*mfile 

_ ■ . , ( D )TOPOI.OGy: Imear , 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRirnON: SEQ ID N0:3C»: 

CACNAAAGAT 



1 0 



( 2 ) [NFORMAnON FOR SEQ ID NOSia , _ . , 

( i ) SEQUENCE CHARACTERISrnCS: 
< A ) LENCIH: 10 base pairs 
( B )TYFE: nucleic acia 
( C ) SrniANDEDNESS: smgle 
( dJtOPOLOGV: linear 

( I i ) MOLECULE TYPE: DNA Oxobc) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:310: 

AGAAACNACA " 



1 0 



( 2 ) INFORMATION FOR SEQ ID N0:31l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 16 b»e pairs 
( B )1YP£: nticlcic acid 

..:,:.( C)SrRANDEDNESS:»!n^e : 

(D)TOPOLOGY: linear 

( I I )MOUECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIFHON: SEQ ID N0:311: 

ATTTCATTCT GTATTG 



1 6 



( 2 ) INFORMATION FOR SEQ ID NO:3t2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A )LEKarH: 16basepain 
( B )TYFE:itBCkIc acid 
( C ) 5IRANDEDNESS: cln^e 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFnON: SEQ \D N0-J12: 

CCGACTCCAG TCGTTA 



1 6 



( 2 ) INFORMAnON FOR SEQ ID N0:313: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGIH: 15 base pain 
( B )TyPE: BQdetc add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: Uaear 

( 1 i ) MOLECULE TYPE: DNA (ipcobe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:313: 

CCGACTCCAG TCGTT 



1 S 



( 2 ) INFORMAnON FOR ^ ID N0-JI4: 
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( I ) SEQUENCE CHARACTERISTICS: • 

( A ) LEKGTK: 15 base pairs f ' 
- ( B ) TYPE: nocJeic acid . / V - ■ 

( C ) STRANDEDNESS: »inglc - ! • • - - 
' ' ( D )TOP0L0GY: linw 

( i i ) MOLECUtE TYPE: DNA (probe) / . . ' , . 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:314: . 

^■'ccGACTACAG -'t c G T T'^^^y^^^{^'K;s^'y'^ '-'^ ' '': ':' 

( 2 ) INFORNMnON FOR SEQ ID N0:315: • " - 

( i ) SEQUENCE CHARACIERISnCS: ' ' 
( A ) LENGTH: 15 base pairs . , 

„.!..„..( B) TYPE: nnclcic acid ...^ _ J . _ 

: . ~ " V ' ( C ) STRANDEDNESS: single " ' " ^- " ' % ' ^ ' " ; 

( D )TOPOLOGY: linear V ' ^ . ' 

( i i )MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ED NO:315: 

CCGACTCCAG TCGTT 



( 7 ) INFORMAnON FOR SEQ ID NO:316: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nocleic add 
. ( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i > MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ EDNO:316: 

CCGACTTCAG TCGTT 



( 2 ) INFORMAnON FOR SEQ ID NO-J17: 

( ! ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base paiis 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:317: 

GTAATTTCTT TTATAGTACA AACCACAAAC GATAC 



( 2 ) INFORMAnON FOR SEQ ID NO-JlS: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nncleic »ctd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (ollgonucleotWe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO-J18: 

CATTAAAGAA AATATCATCT TTGGTGTTTC CTATG 



( 2 ) INFORMAnON FOR SEQ ID N0-J19: 

( 1 ) SEQUENCE CHARACmUSnCS: 
( A ) LENGTH: 32 hisc pairs 
( B )TYPE:nadeIcsdd 
( C ) STRANDEDNESS: alnglc 
( D) TOPOLOGY: linear 
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(i i )M01£CUl£ TYPE:DXA(oOgonudeolKJe) ■ : ^ . 

(x i ) SEQUENCE DESCRIFnbW: SEQ ro NO:319: ' 
CATTAAAGAA AATATCATTO GTGTTTCCTA TO 32 

( 2 ) INFORMtfnON FOR SEQ ID Nam . / • ' 



■ ■ / ( i ) SEQUENCE CHARACTERISTICS:" 
( A ) LEKOnt 18 base paks 
• : ":( B )'nfFE:inieIeScadd ; * ; ^ i * ' . " , 

( C ) STRANDEMffiSS: sin^e . . - 

( D )TOPOLOGY:B»ear 

( i i ) MOLECULE TYPE* DNA(iK*e) 

( X 1 ) SEQUENCE DESCRffTION: SEQ ID Nb:320: 

CATTAAAGAA AATATCATj -^^ 

( 2 ) INFORMAnON FOR SEQ ID NO-jak 

( I ) SEQUENCE CHARACICRISTICS: 
( A ) LENGTH: 35 Ittse pairs 
( B ) TYPE: noddc add 
( C ) STRANDEDNESS: 5ini5!e 
( D )TOPOLOGT6liDear 

( i i ) MOLECULE TyPE:DNA(oCg«iTicIeotidc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO J21: 

TATTAAAOAA AATATCATCT TTCGTOTTTC CTATC 3 5 

( 2 )INFORMAnONFORSEQIDNOJa2:. . . _. . ... - 

( I ) SEQUENCE CHARACfERlsnCS: ... 
( A ) LENGTH: 35 base pairs 
( B )TYFE:DiKleIcacid 
( C } SIKAKDEDNESS: ftinfje 
( D )T0FOL0GY^Imetf 

( i i ) MOLECULE TYFE:D^(oaeoaDdcotk!e) 

( X I ) SEQUENCE DESCRIFTION: SEQ ID N0022: 

CCTTAAAGAA AATATCATCT TTCGTGTTTC CTAAA 3 5 



( 2 ) INFORMAnON FOR SEQ ID NO:323fe 

( I ) SEQUENCE CHARACIERiniCS: 
( A ) LENGTH: 35 Inse pain 
< B )TYP£:Di>ddcadd 
( C ) SIRANDEDNESSrwngrc 
( D)TDPOLOG%liBcar 

( i i ) MOLECULE TYPE: D>U(aQs(niictedde) 

( X 1 ) SEQUENCE DESCRDOION: SEQ ID NO-J23: 

CTTTAAAGAA AATAAAAAAA TTCGTGTTTC CTAAA 3 5 



( 2 ) INFORMymON FOR SEQ ID NO:324c 

( 1 ) SEQUENCE CHARACIESISnCS: 
( A ) LENGTH: 20l»sepalis 
( B) TYPE: DDctdc add 
( C ) STRANDEDNESS: aSnste 
( D) TOPOLOGY Inev 

( i I ) MOLECULE TYPE: DNACprobe) 

( X I ) SEQUENCE DESCIOFnDN: SEQ ID NO-J24; 
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GGAAGTCTCC CATTTTAATT 



2 0 



( 2 ) INFpRM>tnON FOR SEOID NO-J25: • , ' 

. \ ( I ) SEQUENCE CHARACTERISTICS: 
; : " <A)LENCnH: 20 base pairs : . 

■ - ^ ; - ■' - :V- - .^ ( B )TnTE:nTidw»ci<J -- /I-;--^- V: ' 
. . . ( C ) STRANDEDNESS: single" 
( D )TOPOLOCY:UDeai 

(II )MOUECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRDTION: SEQ ID NOJ25: 

CCTTCAGAGG GTAAAATTAA 



2 0 



( 2 ) INFORMAnON FOR SEQ ID NOj26: " T ''^\ 

( i ) SEQUENCE CHARACIERISnCS: . 
( A ) LENGTH: 20 base pain 
( B )TY7E:aoclctcac]d 
( C ) STRANDEDNESS: aingle 
( D)TOPOLOCV.UMai 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:326: 

CCTTCAGAGK GTAAAATTAA 



2 0 



( 2 ) INFORMAnON FOR SEQ ID NOa27: 

( I ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 20 base paira 
( B )TYPE:DDdeic>c;d 
( C ) SmtANDEDNESS: single 
( D )T0P0L0GV.Iiiic3i : 

( I I ) MOLECULE TVTE: DNA (pfobe) 

( X I ) SEQUENCE DESCRDTION: SEQ ID NO:327: 

CCTTCAGAGT GTAAAATTAA 

( 2 ) INFORMATION FOR SEQ ID N0J2& 

( I ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTH: 19 base pairs 
( B )TVPE:nodekKid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY, linear 

( I 1 ) MOLECULE TTFE: DNA Ovobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO-J28: 

CCTTCAGAGG GTAAAATCA 

( 2 ) INFORMATION FOR SEQ ID NO-J29: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pain 
( B ) TYPE: DDdeIc add 
( C ) STRANDEDNESS: single 
( D) TOPOLOGY: Ilneai 

(II ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO J29: 

CCTTCAGAGG CTAAAATTA 



2 0 



1 9 



1 9 



( 2 ) INFORMAnON FOR SEQ ID K0-J3a 
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• ( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 19 b*« pairs ^ 
.. ( B ) TYPE* nnclwc acid \ ■■■ . . - . 
. '^•7- ( C ) StRANDEDNESS: sinjgic ; - , : . . 

( D)T0P0LOGY: linear ' 

, ( 1 i ) MOLECULE TYP&DNA (probe) / 

, ( x i ) SEQUENCE DESCRimON: SEQ ID NO:330: 

GATTCAGAOT G T A A A AT A C ' ^ ; / • ^ 

( 2 ) INFORM^OTON FOR SEQ ro NO:331: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs , 

( B ) TYPE: nucleic acid : 

~ ■ - : ('c )srRiW^bEiWEss^ c ■ . ^v"'-: . 

- ( D )TOPOLOGY: linear ' • ' 

( I i ) MOLECUl£ TYPE: DNA(ptobe) / 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:33l: 
AAAAAAOAGT CTAAAATGA 

( 2 ) INFORMAHON FOR SEQ ID NO:332: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B )TYPE:ttiicUlc»d<l 
( C ) STRANDEDNESS: aingTc 
( D )TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA(oligoiiuclcoUtIc) . 

( X i ) SEQUENCE DESCiaPTION; SEQ ID NO:332: 

CATTAAAGAA AATAACATCA TTGGTGTTTC CTATG 



( 2 ) INFORMATION FOR SEQ ID NOJ33: 

( i ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 648 base pain 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA(oligoflucIcoUdc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:333: 

AACAAACCTA CCCACCCTTA ACAGTACATA 

ATTACAGTCA AATCCCTTCT CGTCCCCATG 

CCACCATCCT CCGTCAAATC AATATCCCGC 

CATAACACTT GGGGGTAGCT AAAGTGAACT 

CATAAAGCCT AAATAGCCCA CACGTTCCCC 

CTATCACCCT ATTAACCACT CACGGGAGCT 

GTATGCACGC GATAGCATTO CGAOACGCTG 

TCTTTOATTC CTGCCTCATC CTATTATTTA 

ATACTTACTA AAGTGTGTTA ATTAATTAAT 

GTCTGCACAG CCACTTTCCA CACAGACATC 

CTCCCCCGCT TCTGCCCACA CCACTTAAAC 



GTACATAAAG CCATTTACCG 



GATGACCCCC CTCAGATAGG 



ACAAGAGTGC TACTCTCCTC 



GTATCCGACA TCTGGTTCCT 



TTAAATAAOA CATCACGATG 
CTCCATGCAT TTGGTATTTT 



GAGCCGGAGC ACCCTATGTC 



TCOCACCTAC GTTCAATATT 



GCTTGTAGGA CATAATAATA 
ATAACAAAAA ATTTCCACCA 



ACATCTCTGC CAAACCCC 



( 2 ) INFORMAnON FOR SEQ ID NO-J34: 
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/ ( I ) SEQUENCE CHARACTERISTICS: 
. ( A ) LENGTH: 12 base pain 

. - . ■ V < B )TYPE: nucleic acid ' * .7-, - - • .• • V ";. ^ \ ' ., 

. ,( C ) STEIANDEDNESS: single ' - : /. 1 .* . . 

; ( D )TOPOLOGY linear - ^ . - . 

- ill ) MOLECULE TYPE: DNA(piobe) ... ^ \ \. . . ^ , U .-C ; J . . - - V" 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:334: 

GATGdtGAGG''-'A'0""f^^^^)>'^ ^f^-^^^:'r?:^'-rl^y- '^^ .-"/^ • / 

< 2 ) INFORMATION FOR SEQ ID NO-J35: ' 

( I ) SEQUENCE CHARACTERISTICS: 
^ . . ( A ) LENGTH: 12 base pairs 

" ^/ .^ • ( C )STRANDEDNESS: single r r . - , V . . " • V:;"^; 'r ; - r- - ■ - 

( D )TOPOLOGY: linear . ~ / . "-^ ■ 

( i I )MOLECULE TYPE: DNA (probe) . 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO J35: 

CTCCTCCCCG GT • 



( 2 ) INFORMAnON FOR SEQ ID NO-J36: 

( I ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C } STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA(pcobc) 

; _ - ( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:33«: 

ACTCCTCCCC GG 



( 2 ) INFORMAnON FOR SEQ ID NO-J37: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nncleic acid 
( C ) STRANDEDNESS: singtc 
( D ) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID N0.337: 

GACTCCTCCC CG 



( 2 ) INFORMAnON FOR SEQ ID NO-J38: 

( I ) SEQUENCE CHARACIERISnCS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: tnclelc acid 
( C ) STRANDEDNESS: sbgte 
( D ) TOPOLOGY: lineai 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

CGACTCCTCC CC 



( 2 ) INFORMAnON FOR SEQ ID K0039: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B )TYFE:nDcleicadd 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: Ilnm 
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( i i )MOI£CULB TYPE; DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO-J39: 
ACGACTCCTC : CC 
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( 2 ) INFORM/mON FOR SEQ ID KO-J40: 

' ; ' : ( i ) SEQUENCE CHARAClERISrnCS: \ ' ■ 

( A ) 1£KGTH: 12 base pairs . 
" ( B )TYPE:iiBddc add ^ , . • 
( C isTRANDEDNESSiBingle 
( p )TOIpLOGY: Ibiear 

( j i ) MOLECULE TYPE: DNA ((robe) 

( X i ) SEQUENCE DESCRIPnON: SEQ ID NOt340: 

TACGACTCCT CC 



1 2 



( 2 ) INFORMAHON FOR SEQ ID N0-J41: 

( i ) SEQUENCE CHARACIERISnCS: 
( A ) LENGIH: 12 base pairs 
( B ) TYPE: aoddc acid 
( C ) SnUANDEDNESS: single 
( D ) TOPOLOGY: ihear 

( i i ) MOLECULE TYPE: DNA (piobc) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID N0a41: 

CTACOACTCC TC 



I 2 



( 2 ) INFORMAnON FOR SEQ ID NO'J42: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENCTK: 12 base pairs 
( B) TYPE: irodeic Kid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:342: 

TCTACGACTC CT 



1 2 



( 2 ) INFORMAHON FOR SEQ ID N0-J4>. 

( i ) SEQUENCE CHARACIERtSnCS: 
( A ) lEKGTH: 12 base palis 
( B )TYFE:niKleIcadd 
( C )STKANDEDNE$S: single 
( D)T0FOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( X ! ) SEQUENCE DESCRDTTON: SEQ ID NO:343: 

TTCTACGACT CC 



1 2 



( 2 ) INFORMAnON FOR SEQ ID NO-^M: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) tEKCTH: 12 base pairs 
( B )TYFE: DDdeic tdd 
( C ) SIRANDEDNESS: single 
( D)T0P0LOCY: linear 

( i I ) MOLECULE TYPE: DNA (prabe) 



( X I ) SEQIJENCE DESCRIFnON: SEQ ID NO-J44: 
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' ( 2 ) INFORMATION FOR ^ ID NO:34S: , \ ■ = 

( i ) SEQUENCE CHARACTERISTICS: . . 

/V- , - „ ; ( A)IJENGTH:12ba$cpair5 - - . - .1... / 

" * . ^ ( B)TVPE:ttTiclcic»cid ' " ' • ' ■ r . 

( C ) STOANDEDNESS: single 

-l- ^ ^ „( D)T0P0LO0y: linear . . ' f . ... , 

. ( i , I ) MOLECULE TYPE: DNA (probe) , ' 
^ - ( X I ) SEQUENCE DESCRIPTION: SEO ID NO:345; ' . . . . ' ' 

TATTCTACGA CT . - 

12 

7: X 2 ) INFoki^ONFORSEQ "7 " . 1." ' ' ' "V : " - ---—y;— - r-"^ .= 

: _ . ( i )SEQUENCECHARACrERISnCS: 

' ( A ) LENGTH: 12 base pahs ' / * ■ " 

( B ) TYPE: nnclcic acid 
( C ) SraANDEDNESS: single . 

( D )TOFOLOGY:Ibear , , . . 

( 1 I )MOLECULEryPE:DNA(probc) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO J46: 
CTATTCTACC AC 

1 2 

( 2 ) INFORMAnON FOR SEQ ID NO:347: 

( i ) SEQUENCE CHARACIERISnCS: 
( A)LENGTH:l2ba»epain 
< B ) TYPE- nucleic »cid 

L,. .7 • . ( C ) STOANDEDNESS: »lnglc _ „ ' 

( D )TbPOLOG^ linear . ."'7 f 77 .7L r7'77Tl!!7 ' ^ H\ ' 

( I I )MOLECULETirPE:DNA(probe) 
( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:J47: 
CCTATTCTAC OA 

1 2 



( 2 ) INFORMATION FOR SEQ ID 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENOIH: 10 base pain 

< B )TYPE: nucleic acid 

( C ) SniANDEDNESS: linglc 
( D )T0FOLOGY: lineal 

( I I )MOLECULETlfPE:DNA(piobe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO J48: 

TCCTCCCCGO 

1 0 

( 2 ) INFORMATION FOR SEQ ID NO-J49: 

( I ) SEQUENCE CHARACIERBTICS: 

< A ) LENOIH: 10 base pain 
( B )TYPE:aocleIcadd 

( C ) STRANDEDNESS: tingle 
( D ) TOPOLOGY: lineai 

( I I ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPITON: SEQ ID NOJ49: 

CTCCTCCCCG 

1 0 



( 2 ) INFORMAnON FOR SEQ ID N0-J5a 
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( i ) SEQUENCE CHARACHERISrnCS: . 

• ( A ) LENCTTK: 10 base pain . 
- . . ( B ) TYPE: niKlcic acid ;: ■ 

. ( C ) STRANDEDNESS: ainglc - 
( D )TOPOLOGY: linear 

■ ( i i ) MOLECULE TYPE; DXA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:350: 

ACTCCTCCCC '"^-''V*'-- '.": ;■' * ■■ 

( 2 ) INFORMAnON FOR SEQ ID NO:351: 

( i ) SEQUENCE CHARACTERISTICS: 
., ( A ) LENGTH: 10 base pain 

„ , X B .) TYPE: niKlcic acid „ : _ _ , 

: r _ ( C ) SriRANDEDNESS: sin^c . Z. ■ . 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA O^robc) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:35l: 

GACTCCTCCC 



( 2 ) INFORMAnON FOR SEQ ID NO:352: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LBNGTH: 10 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:352: 

CGACTCCTCC 



( 2 ) INFORMAnON FOR SEQ ID NO:3S3: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 ba*e pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:353: 



ACGACTCCTC 



( 2 ) INFORMAnON FOR SEQ CD NO:3S4: 



( I ) SEQUENCE CHARACTERISTICS: 
( A )LEN(7rH: 10 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D )TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO:354: 



TACGACTCCT 



( 2 ) INFORMAnON FOR SEQ ID NO:355: 



( I ) SEQUENCE CHARACTEROTCS: 
( A ) LENGTH: 10 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



1 
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( i i )MOLECUUE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIFnON: SEQ ID NO:355: 



CTACO ACTCC 



1 0 



( 2 ) INFORMAnON FOR SEQ U) NO:356: 

( I )SEQUENCECHARAClTERCSTrCS: ' 
( A ) LENGTH: 10 base pairs 

' * . ( B ) TTPE: nodele odd 

( C ) STRANDEDNESS: single 
( D>TOK}L0GY:imca7 

( i i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:356: 

TCTACOACTC . ; 

t 

( 2 ) INFORMATION FOR SEQ ID NOJ57: 

( I ) SEQUENCE CHARACTERtSnCS: 
( A )lJeKCTH: lObasepaIrs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D)T0FOLOGY: linear 

( i i > MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIFTtON: SEQ ID NO:357: 

TTCTACGACT 

. ( 2 ) INFORMATION FOR SEQ ID NOJ58: . „ ' , 

( I ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTH: lObasepalis 
( B ) TYPE: nocleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( X i ) SEQUENCE DESCRimON: SEQ ID NO:358: 

ATTCTACGAC 

( 2 ) INFORMAnON FOR SEQ ID NO:aS9: 

( I ) SEQUENCE CHARACTERISnCS: 
( A ) LENGTH: 10 base pain 
( B )TYPE:iioc|dcftdd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( X I ) SEQUENCE DESCRIPTION: SEQ ID NO J59: 

TATTCTACOA 

( 2 ) INFORMAnON FOR SEQ ID NO*J6a 

( I ) SEQUENCE CHARACIERISTICS: 
( A ) LENGTH: 184 base poira 
( B )TYP&nodeicsdd 
( C ) SntANDEDNESS: tingle 
( D)T0P0LOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (ol^anndeotide) 

( X I ) SEQUENCE DESCRimON: SEQ ID KO:360: 



1 0 
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TACTCCCCTG 


CCCTCAACAA 


OATGTTTTO'C 


CAACTGGCCA 


AGACCTGCCC 


TGTGCAGCWG 


6 0 


KGGGWWGATT 


^XCACA CCCCC 


GCCCGGCACC 


CGCGTCCGCG 


CCATGGCCAT 


CT ACAAGC AG 


12 0 


TCACAGCACA 


: TGACGGAGGW. 


WGKGAGGCGC 


TGCCCCCACC 


atgagcgcyg' 


CYCAGATAGC 


18 0 


S AYO ~ - 
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Wbclaim: . 3'-tagXAGXAACCACAA (SEQ ID. N0:13); 

1. An array of oligonucleotide probes immobilized on a . 3*-AGTAGAXACCACAAA (SEQ ID. No:14); 
solid support, said array having at least 100 probes and no ' 3'-GTAGAAXC€ACAAAG (SEQ ID. N0:15); 
more than 100,000 different oligonucleotide probes 9 to 20 3*-TAGAAAXCACAAAGG (SEQ ID. N0:16); and 
nucleotides in length occupying separate known sites in said 15 3'-AGAAACXACAAAGGA (SEQ ID, NO: 17); wherein 
array, said oligonucleotide probes comprising at least four each set comprises 4 probes, and X is_ individually A, G, 
sets of probes: (1) a first set that is exactly complementary ^ q and T for each seL , - 

to a reference sequence and comprises probes that coin- 9,' The array of claim 1, wherein said reference sequence 

pletely span the reference sequence and, relaUve to the ^ ^ sequence of a D-loop region of human mitochondrial 

reference sequence, overlap one another m sequence; and (2) 20 ona. 

three additional sets of probes, each of which is identical to ^ ^^^^ ^ ^^^^^^ ^.^ ^5 

said first set of probes but for at least one different i *-j • i .i, j -j - c , * f 

. t. i-rc * 1 ■ 1 * J • « nucleotides m length, and said array compnses a first set of 

nucleotide, which different nucleotide is located m the same , , . . . . ^ . 

position in each of the three additional sets but which is a P^^^ ^""'^^y complementary to a sequence contained m a 

different nucleotide in each set. 25 ^""^^"^ ^""^^'^ P^^^^"^ 1° of the refer- 

2. The array of claim 1, further comprising a fourth ence sequence and four additional sets of probes identical to 
additional set of probes, which fourth additional set is said first set but for position 7, relative to a 3'-end of a probe, 
identical to probes in the first set which 3'-end is covalenfly attached to the substrate, where, 

3. The array of claim 1, wherein said reference sequence for each of the four additional probe sets, a different nucle- 
is a double-stranded nucleic acid and probes complementary 30 otide is located, such that, for each probe in said first set, 
to both strands of said reference are in said , array. there is an identical probe in one of the four additional sets, 

4. The array of claim 1, wherein said probes are 12 to 17 ■ and such that the array has between 2500 and 100,000 
nucleotides in length. oligonucleotide probes. . _ : . ^ 

* 5. The array of claim 4, wherein said probes are 15 11. The array of claim 1, wherein said reference sequence 

nucleotides in length and attached by a covalent linkage to 35 is a sequence fi'om an exon of a human p53 gene, 

a site on a 3*-endofsaid probes, and said different nucleotide 12. The array of claim 11, wherein said reference 

is located at position 7, relative to the 3'-end of said probes. sequence comprises at least a 60 nucleotide contiguous 

6. The array of claim 1, wherein said reference sequence sequence from exon 6 of a p53 gene. 

is exon 10 of a CFTR gene, and said array has between 1000 13. The array of claim 11, wherein said reference 

and 100,000 oligonucleotide probes 10 to 18 nucleotides in 40 sequence is exon 5 of a p53 gene, said probes are 17 

length. nucleotides long, and said array comprises a first set of 

7. The array of claim 6, wherein said array comprises a set probes exactly complementary to said sequence and at least 
of probes comprising a specific nucleotide sequence selected three additional sets of probes, each set comprising probes 
from the group of sequences consisting of: identical to said first set but for a nucleotide at position 7, 
3'-TTTArAXTAG (SEQ ID, NO:302); 45 relative to a 3*-end of a probe, which 3'-end is covalently 
3'-TTArAGXAGA (SEQ ID, NO:303); attached to the substrate, which nucleotide is different from 
3*-TArAGTXGAA (SEQ ID. NO:304); a nucleotide at this position in a corresponding probe of said 
3*-ATAGTAXAAA (SEQ ID. NO:305); first set, 

3'-TAGTAGXAAC (SEQ ID. NO:306); 14. The array of claim 1, wherein said probes are oli- 

3»-AGTAGAXACC (SEQ ID. m:307); 50 godeoxyribonucleotides. 

3'-GTAGAAXCCA (SEQ ID. NO:308); 15. The array of claim 1, wherein said array has between 

3'-TAGAAAXCAC (SEQ ID. NO:309); and 10,000 and 100,000 probes. 

3'-AGAAACXACA (SEQ ID. NO:310); wherein each set 16. The array of claim 1, wherein the reference sequence 
comprises 4 probes, and X is individually A, G, C, and T is from a human immimodeficiency virus, 

for each set. 55 17. The array of claim 16, wherein the reference sequence 

8. The array of claim 6, wherein said group of sequences is from a reverse transcriptase gene of the human immuno- 
consisls of: deficiency virus. 

3»-T7TArAXTAGAAACC (SEQ ID. N0:9); 18. The array of claim 1, wherein said probes are immo- 

S'.ITATAGXAGAAACCA (SEQ ID. NO:10); bilized to said solid support via a linker. 
S'-TATAGTXGAAACCAC (SEQ ID. N0:11); 

3'-ArAGTAXAAACCACA(SEQID. N0:12); • » ♦ ♦ ♦ 
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